Campus Seach Engine

About

Building a local query-based search engine to assist searching and indexing over local area networks. Custom searching and results display.

Crawling: Using Scrapy, BeautifulSoup to scrap Internet and Intranet websites of IIT Guwahati relevant to the campus and preparing dataset to be indexed
Indexing: Using Whoosh to index the scraped data, based on schemas defined namely TITLE, PATH and CONTENT
Ranking: Page ranking algorithms to be implemented to implement relevant search results

Dependencies

Scrapy
BeautifulSoup
Whoosh
Django

Code Files

datablogger.py: running the scrapy script for scraping www.iitg.ac.in
whoosh.py: indexing the scraped data according to schemas created
search.py: running searches on the indexed data

Project Deployment

Django based back-end used to deploy the Search engine in the form of a website with comaptible search results on local IITG websites. Project is currently being developed under Students' Web Committee (SWC) IIT Guwahati and has a expansive scope.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
Indexing		Indexing
Scraping		Scraping
README.md		README.md
dataset.xlsx		dataset.xlsx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Campus Seach Engine

About

Dependencies

Code Files

Project Deployment

About

Releases

Packages

Languages

maneshwarS/Campus-Search-Engine

Folders and files

Latest commit

History

Repository files navigation

Campus Seach Engine

About

Dependencies

Code Files

Project Deployment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages