Project presentation
We are getting closer to the end of the course.
We will be conduct project presentations on December 6th.
You need to present the following
1. Crawling technique
2. Your index creation methodology
3. Sample queries (4-5)
Each team gets a 5 minutes to present and 5 minutes for questions.
You are required to implement a search engine for unt.edu.
implement vector space retrieval model for the search
Evaluation of the system:
Select a word of your choice
Run the query on the original unt.edu
Run the same query on your system
Compare the result and report any discrepancies
Please note: You need to crawl the unt.edu to collect webpages in unt.edu and parse them to get terms that may end up in your dictionary.
You can use any library to crawl and parse web pages or you can use your own custom built crawler/parser.
Submit a report explaining steps to run the search engine and sample results for set of search terms.
Do you know about the indexing and all
We need all of these 3
Need report as well 1500 words
Make sure we get all the crawling techniques indexing and all