Skip to content

This is implementation of a tiny search engine in Python.

Notifications You must be signed in to change notification settings

Ankit-rana/tiny-search-engine

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

tiny-search-engine

This is implementation of a tiny search engine in Python.

This project contains 2 branches:

master branch: synchronous code

async_version branch: asyncio code

This project have 3 parts:

Universal Crawler

This module is responsible for taking baseURL as input and crawling to other URL's associated with the baseURL until the given depth. it then returns the crawled websites data in the form of html file stored in the destination provided by the user.

Indexer

This module is responsible for indexing the crawled data for the query engine to search and return the desired result.

Query Engine

This module is responsible to take the search string as an input,look up in the indexer to return the document id and URL in descending order based on their occurance.

Steps to execute the modules:

Crawler sudo python crawler.py <baseURL> <Destination> <Depth>

Indexer sudo python indexer.py <target_directory> <Output_filename>

Query Engine sudo python queryEngine.py <Enter string to be searched>

About

This is implementation of a tiny search engine in Python.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%