Fails to add vectors to GpuIndexIVFPQ #440

ZhuoranLyu · 2018-05-09T09:27:23Z

Summary

Trying to train GpuIndexIVFPQ and add vectors to the index. Seg fault.

Platform

OS: Ubuntu 16.04

Faiss version:

Faiss compilation options:

Running on :

GPU Titan X 12GB

Reproduction instructions

Briefly, I was trying to search in 50 million 128D vectors. I used GpuIndexIVFPQ(PQ8) with a GTX Titan X with 12 GB memory. It crashes when I added the vectors to the index. However, it works well when I add 40 millions vectors. I checked the nvidia-smi and find that there is enough mem to use.

Here is the bt of gdb:
Program received signal SIGSEGV, Segmentation fault.
0x00007fffdc1e9b9c in ?? () from /usr/lib/x86_64-linux-gnu/libcuda.so.1
(gdb) bt
#0 0x00007fffdc1e9b9c in ?? () from /usr/lib/x86_64-linux-gnu/libcuda.so.1
#1 0x00007fffdc29610e in ?? () from /usr/lib/x86_64-linux-gnu/libcuda.so.1
#2 0x00007fffdc35edf9 in ?? () from /usr/lib/x86_64-linux-gnu/libcuda.so.1
#3 0x00007fffdc35f9c5 in ?? () from /usr/lib/x86_64-linux-gnu/libcuda.so.1
#4 0x00007fffdc297620 in ?? () from /usr/lib/x86_64-linux-gnu/libcuda.so.1
#5 0x00007fffdc1b93f8 in ?? () from /usr/lib/x86_64-linux-gnu/libcuda.so.1
#6 0x00007fffdc1ba910 in ?? () from /usr/lib/x86_64-linux-gnu/libcuda.so.1
#7 0x00007fffdc2fa8b2 in cuMemcpyHtoDAsync_v2 ()
from /usr/lib/x86_64-linux-gnu/libcuda.so.1
#8 0x00007ffff416e8cc in ?? ()
from /usr/local/cuda-8.0/lib64/libcudart.so.8.0
#9 0x00007ffff414ab5b in ?? ()
from /usr/local/cuda-8.0/lib64/libcudart.so.8.0
#10 0x00007ffff4184b08 in cudaMemcpyAsync ()
from /usr/local/cuda-8.0/lib64/libcudart.so.8.0
#11 0x0000000000435609 in faiss::gpu::Tensor<float, 2, true, int, faiss::gpu::traits::DefaultPtrTraits>::copyFrom(faiss::gpu::Tensor<float, 2, true, int, faiss::gpu::traits::DefaultPtrTraits>&, CUstream_st*) ()
#12 0x0000000000433a69 in faiss::gpu::DeviceTensor<float, 2, true, int, faiss::gpu::traits::DefaultPtrTraits> faiss::gpu::toDevice<float, 2>(faiss::gpu::GpuResources*, int, float*, CUstream_st*, std::initializer_list) ()
#13 0x000000000043de15 in faiss::gpu::GpuIndexIVFPQ::addImpl_(long, float const*, long const*) ()
#14 0x000000000043a9e2 in faiss::gpu::GpuIndex::addInternal_(long, float const*, long const*) ()
#15 0x000000000043a744 in faiss::gpu::GpuIndex::add_with_ids(long, float const*, long const*) ()
#16 0x000000000040c90b in CwAnnTopkImpl::add_with_batch_gpu (
this=0x7fffffffc080, vec_feats=0x7ff9d3128010, feat_num=50099900,
ids=0x7ff9b52e0010) at CwAnnTopkImpl.cpp:219
---Type to continue, or q to quit---
#17 0x000000000040daed in CwAnnTopkImpl::add_with_ids_cwfeat_gpu (
this=0x7fffffffc080, vec_feats=0x7ff9d3128010, feat_num=50099900,
feat_dim=128, ids=0x7ff9b52e0010) at CwAnnTopkImpl.cpp:560
#18 0x0000000000409b00 in main (argc=1, argv=0x7fffffffe118)
at test/test_cwimpl_testhitrate.cpp:143

Appreciate any help.

wickedfoo · 2018-05-15T17:55:39Z

@ZhuoranLyu can you use gdb to print out the locals and arguments to stack frame 11 above (the one with faiss::gpu::Tensor<float, 2, true, int, faiss::gpu::traits::DefaultPtrTraits>::copyFrom(faiss::gpu::Tensor<float, 2, true, int, faiss::gpu::traits::DefaultPtrTraits>&, CUstream_st*) ()?

wickedfoo · 2018-05-15T17:56:09Z

similarly, the arguments and locals to 0x000000000043a9e2 in faiss::gpu::GpuIndex::addInternal_(long, float const*, long const*) ()?

ZhuoranLyu · 2018-05-17T06:23:52Z

@wickedfoo I try to use info args to get the arguments of certain stack frame. However, it always says no symbol table info available. Any other ways to print out the locals?

ZhuoranLyu · 2018-05-17T06:49:22Z

figure it out. my bad.

mdouze added question GPU labels May 10, 2018

mdouze assigned wickedfoo May 10, 2018

ZhuoranLyu closed this as completed May 17, 2018

ZhuoranLyu reopened this May 17, 2018

ZhuoranLyu closed this as completed May 17, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fails to add vectors to GpuIndexIVFPQ #440

Fails to add vectors to GpuIndexIVFPQ #440

ZhuoranLyu commented May 9, 2018 •

edited

Loading

wickedfoo commented May 15, 2018

wickedfoo commented May 15, 2018

ZhuoranLyu commented May 17, 2018

ZhuoranLyu commented May 17, 2018

Fails to add vectors to GpuIndexIVFPQ #440

Fails to add vectors to GpuIndexIVFPQ #440

Comments

ZhuoranLyu commented May 9, 2018 • edited Loading

Summary

Platform

Reproduction instructions

wickedfoo commented May 15, 2018

wickedfoo commented May 15, 2018

ZhuoranLyu commented May 17, 2018

ZhuoranLyu commented May 17, 2018

ZhuoranLyu commented May 9, 2018 •

edited

Loading