Add IndexInput isLoaded #13998

ChrisHegarty · 2024-11-15T15:07:33Z

This commit adds IndexInput::isLoaded to help determine if the contents of an input is resident in physical memory.

The intent of this new method is to help build inspection and diagnostic infrastructure on top. The initial requirement is to help understand if vector data and more specifically the HNSW graph are in memory. For search use cases, performance drops off significantly if, at least, the graph is not resident. This is not a perfect API, more of a hint, but along with read advice like MADV_WILLNEED may be used to determine perf issues searching vectors

navneet1v · 2024-11-15T18:32:24Z

@ChrisHegarty this will be a very useful thing. Can we also figure out how much data is loaded with this API? So lets say an IndexInput is 30GB and only 10GB is loaded/mapped in memory can return that too?

ChrisHegarty · 2024-11-18T09:34:06Z

@ChrisHegarty this will be a very useful thing.

Indeed.

Can we also figure out how much data is loaded with this API? So lets say an IndexInput is 30GB and only 10GB is loaded/mapped in memory can return that too?

While possible, it's not straightforward and would require some native access. For now, let's go with the basic loaded / not-loaded, since this is useful as is.

rmuir · 2024-11-18T14:33:54Z

You would need to call mincore or something yourself. I can't remember, but the native access may already be plumbed.

for non-mmapped i/o you can do similar with syscalls such as cachestat but you need modern linux kernel for that.

lucene/core/src/java21/org/apache/lucene/store/MemorySegmentIndexInput.java

ChrisHegarty · 2024-11-18T14:54:48Z

Yeah, we can look at how to call mincore, and it might not be that much of a lift with the existing plumbing. Maybe something can look at as a follow up? I'm really trying to get to a situation where we can load (MADV_WILLNEED), and check even the HNSW graph. Maybe even mlock, as a potential follow up. Since not having the graph in memory results in horrible perf (need to get some numbers).

rmuir · 2024-11-18T15:08:47Z

Yeah, we can look at how to call mincore, and it might not be that much of a lift with the existing plumbing. Maybe something can look at as a follow up? I'm really trying to get to a situation where we can load (MADV_WILLNEED), and check even the HNSW graph. Maybe even mlock, as a potential follow up. Since not having the graph in memory results in horrible perf (need to get some numbers).

yes, agreed about mincore as a followup. Let's use existing JDK plumbing as a start as done here.

i'm very much against using mlock, there are so many problems with this. With an out of box linux system my ulimit for this is set to 8MB. I really don't think we should be mlocking gigabytes of vectors because the access is inefficient. It would be better to improve documentation, so that users avoid the typical mistakes such as setting too-big java heap (leaving no room for buffers/cache), configure swappiness if needed, etc. mlock will just make problems worse.

rmuir · 2024-11-18T15:12:47Z

Also for debugging these issues, you can get this information at non-java level using fincore from util-linux, which is probably on any machine:

myindexdir$ fincore --output-all *
PAGES  SIZE FILE               RES DIRTY_PAGES DIRTY WRITEBACK_PAGES WRITEBACK EVICTED_PAGES EVICTED RECENTLY_EVICTED_PAGES RECENTLY_EVICTED
...

ChrisHegarty · 2024-11-19T09:26:42Z

yes, agreed about mincore as a followup. Let's use existing JDK plumbing as a start as done here.

++

i'm very much against using mlock, there are so many problems with this. With an out of box linux system my ulimit for this is set to 8MB. I really don't think we should be mlocking gigabytes of vectors because the access is inefficient. It would be better to improve documentation, so that users avoid the typical mistakes such as setting too-big java heap (leaving no room for buffers/cache), configure swappiness if needed, etc. mlock will just make problems worse.

Yeah, optionally being able to mlock might just cause more problems than it solves, I'll need to play a bit more with it.

For now, I mostly wanna be able to:

operationally indicate to load something into memory (MADV_WILLNEED), and
verify that something is loaded (this PR).

jpountz

This works for me. Maybe implement this API on our in-memory index inputs to return true, e.g. ByteBuffersIndexInput?

jpountz · 2024-11-20T12:19:41Z

lucene/core/src/java/org/apache/lucene/store/IndexInput.java

+   * hint because the operating system may have paged out some of the data by the time this method
+   * returns. If the optional is true, then it's likely that the contents of this input are resident
+   * in physical memory. A value of false does not imply that the contents are not resident in
+   * physical memory. An empty optional is returned if it is not possible to determine.


It looks like an empty optional and false mostly mean the same thing, which makes me wonder if this should return a boolean directly?

It may also be worth pointing out that this method runs in linear time with the amount of data that this IndexInput exposes (as opposed to constant-time). So it makes little sense to use it to do something like "if (isLoaded() == false) { prefetch(); }"

I added a note about the time complexity. I'd like to keep the tri-state of the return type, at least for now. Since I think will be useful to know that the isLoaded-ness or not, is determinable or not.

lucene/core/src/java21/org/apache/lucene/store/MemorySegmentIndexInput.java

ChrisHegarty · 2024-11-20T16:45:33Z

This works for me. Maybe implement this API on our in-memory index inputs to return true, e.g. ByteBuffersIndexInput?

yeah, I think that this prob makes sense. Lemme satisfy myself that it will always be true.

rmuir · 2024-11-22T15:09:28Z

yeah, I think that this prob makes sense. Lemme satisfy myself that it will always be true.

it won't be in core if currently swapped out, no? I don't think a hardcoded true works.

ChrisHegarty · 2024-11-22T15:57:52Z

yeah, I think that this prob makes sense. Lemme satisfy myself that it will always be true.

it won't be in core if currently swapped out, no? I don't think a hardcoded true works.

Yeah. I was taking a little time to consider if it might be worth casting to MappedByteBuffer (for off-heap) to check for residency in physical memory. But I'm not really sure it's worth the effort, given the primary type of introspection that this API will be used - to determine if the random access nature of searching may falloff a performance cliff in production like environments!

jpountz · 2024-11-22T16:54:12Z

Sorry for derailing the PR, let's not implement it on ByteBuffersIndexInput then. We can look into it in a separate PR if we want.

This commit adds IndexInput::isLoaded to help determine if the contents of an input is resident in physical memory. The intent of this new method is to help build inspection and diagnostic infrastructure on top.

Add IndexInput isLoaded

f3603f7

ChrisHegarty requested a review from jpountz November 15, 2024 15:07

itr

81f53de

fix test

601535c

avoid testing with direct IO

87c2b7b

rmuir reviewed Nov 18, 2024

View reviewed changes

lucene/core/src/java21/org/apache/lucene/store/MemorySegmentIndexInput.java Show resolved Hide resolved

early return

1f718c1

serial IO Counting test dir

f758970

jpountz reviewed Nov 20, 2024

View reviewed changes

ChrisHegarty added 2 commits November 20, 2024 16:31

add override anno

02f5b9a

javadoc linear time

3b68ad9

Merge branch 'main' into input_isLoaded

8fdf11f

jpountz approved these changes Nov 25, 2024

View reviewed changes

ChrisHegarty added 2 commits November 29, 2024 09:44

Merge branch 'main' into input_isLoaded

14591a4

add missing wrapper delegate

10fa036

ChrisHegarty merged commit 7dbbd0d into apache:main Nov 29, 2024
3 checks passed

ChrisHegarty deleted the input_isLoaded branch November 29, 2024 10:28

dweiss mentioned this pull request Dec 9, 2024

IndexInput.isLoaded seems to return false for mmap index inputs on Windows #14050

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add IndexInput isLoaded #13998

Add IndexInput isLoaded #13998

ChrisHegarty commented Nov 15, 2024

navneet1v commented Nov 15, 2024

ChrisHegarty commented Nov 18, 2024

rmuir commented Nov 18, 2024

ChrisHegarty commented Nov 18, 2024 •

edited

Loading

rmuir commented Nov 18, 2024

rmuir commented Nov 18, 2024

ChrisHegarty commented Nov 19, 2024

jpountz left a comment

jpountz Nov 20, 2024

ChrisHegarty Nov 20, 2024

ChrisHegarty commented Nov 20, 2024

rmuir commented Nov 22, 2024

ChrisHegarty commented Nov 22, 2024

jpountz commented Nov 22, 2024

Add IndexInput isLoaded #13998

Add IndexInput isLoaded #13998

Conversation

ChrisHegarty commented Nov 15, 2024

navneet1v commented Nov 15, 2024

ChrisHegarty commented Nov 18, 2024

rmuir commented Nov 18, 2024

ChrisHegarty commented Nov 18, 2024 • edited Loading

rmuir commented Nov 18, 2024

rmuir commented Nov 18, 2024

ChrisHegarty commented Nov 19, 2024

jpountz left a comment

Choose a reason for hiding this comment

jpountz Nov 20, 2024

Choose a reason for hiding this comment

ChrisHegarty Nov 20, 2024

Choose a reason for hiding this comment

ChrisHegarty commented Nov 20, 2024

rmuir commented Nov 22, 2024

ChrisHegarty commented Nov 22, 2024

jpountz commented Nov 22, 2024

ChrisHegarty commented Nov 18, 2024 •

edited

Loading