Replies: 3 comments 2 replies
-
Hi @chrisspen - thanks for asking. I don't think Stork will work for you today. You're correct that indexes cannot be added to incrementally, and cannot be sliced/sharded/split. I'm looking into solving for both of these use cases, so it's not out of the question that Stork would be the right solution for you in the future, but I don't think you'll get what you're looking for from Stork today. Out of curiosity, if you're comfortable sharing, what are you working on that needs full-text search of 1M text articles? Best, James |
Beta Was this translation helpful? Give feedback.
-
Thanks for the overview. I have an application that generates transcripts for audio that I publish, and I'm looking for an inexpensive way too make them searchable online without use of a dedicated fts server. I'm currently using a very rudimentary distributed static index that caches keywords to transcript id, so it's not a proper tf-idf index, but it's better than nothing. |
Beta Was this translation helpful? Give feedback.
-
I hope it's not rude to share an alternative (competitor ) product, but consider Pagefind - which breaks up the entire index into tiny chunks (~30k) that are swiftly downloaded as the user searches. |
Beta Was this translation helpful? Give feedback.
-
Hi, I'm looking for a solution to make about ~1 million text articles searchable on the client side. To save costs, I can't host a FTS server, and due to the number of documents, I can't load a large index in the client, nor even regenerate the index from scratch when it's updated on a daily basis.
Would Stork be a possible solution?
From the docs on building an index, it looks like it doesn't support incremental index creation, forcing you to rebuild the entire index after any minor data update. Is this correct?
Also, it looks like there's no support for splitting the index into smaller slices for lazy loading. i.e. If someone searches for "dog", it doesn't make sense to load the entire index containing all search terms. If should only load the parts of the index that include the keyword "dog". Does Stork also not support this functionality?
Beta Was this translation helpful? Give feedback.
All reactions