Skip to content

NoobPeen/GO-text-search-engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GO-text-search-engine

get the wikipedia dump folder from HERE

GO-text-Search-Engine

Overview

This Go project is designed to efficiently preprocess, tokenize, and reverse-index Wikipedia dump data, enabling fast and relevant search capabilities. By leveraging advanced text processing techniques and optimized data structures, this project provides an effective solution for querying large datasets in real-time.

Features

  • Preprocessing and Tokenizing: Processes the Wikipedia dump to clean and tokenize the text, preparing it for indexing.
  • Reverse Indexing: Implements a reverse index for quick lookup of tokens, enhancing search performance.
  • Search Query Interface: Offers a command-line interface that accepts user input for search queries and retrieves relevant results efficiently
To run this project

1)Clone the github repo

git clone https://github.com/NoobPeen/GO-text-search-engine.git

2)Go the directory of the project and run the following command

go build

3)Usage

go run main.go -q "search query"

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages