search-engine

It is capable of answering single-word search queries efficiently. The user needs to give a seed page url as input. The seed page( A good seed page is the one with many links embedded in it) can be any valid url which is used by the program to crawl the web. Given the seed page url, the program takes a couple of minutes (usually 15-20 min) depending upon the seed page, to fetch and preprocess(ranking) the data crawled.

On calculating all the results, user can make infinite number of single-word queries and the program displays the top results based on the search. The very famous Page Rank algorithm has been used in the program. The results are sorted in decreasing order of their ranks computed with the ranks displayed alongside.

To clone the repository

$ git clone https://github.com/laxmanverma/search-engine.git

to run the script

$ python search_engine.py

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
README.md		README.md
search_engine.py		search_engine.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

search-engine

About

Releases

Packages

Languages

laxmanverma/search-engine

Folders and files

Latest commit

History

Repository files navigation

search-engine

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages