Difference between revisions of "Extending a search engine with semantic information"
DavidLaniado (Talk | contribs) (New page: {{ProjectProposal |title=Extending a search engine with semantic information |image= |description=We are used to keyword-based search engines, where only documents matching the exact words...) |
DavidLaniado (Talk | contribs) |
||
(5 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
{{ProjectProposal | {{ProjectProposal | ||
|title=Extending a search engine with semantic information | |title=Extending a search engine with semantic information | ||
− | |image= | + | |image=velociraptor.png |
− | |description=We are used to keyword-based search engines, where only documents matching the exact words in the query are retrieved. | + | |description=We are used to keyword-based search engines, where only documents matching the exact words in the query are retrieved. In a traditional search engine, if you submit the query "a dinosaur in a university in Lombardy" you won't probably find a document containing the phrase "a velociraptor in Politecnico di Milano", even though it's more or less what you were looking for. |
− | + | ||
− | + | ||
+ | Aim of this project is to expand a traditional search engine with semantic information, so that also documents containing words related to the ones in the query can be retrieved. Existing thesauri and ontologies can be used, as well as more dynamic and collaborative sources of knowledge such as user tags and wikipedia pages and categories. | ||
+ | |||
Starting points for this work can be the projects "SeQuEx - Semantic Query Expansion" and "Enriching search results with semantic metadata". | Starting points for this work can be the projects "SeQuEx - Semantic Query Expansion" and "Enriching search results with semantic metadata". | ||
|tutor=DavidLaniado;MarcoColombetti | |tutor=DavidLaniado;MarcoColombetti | ||
+ | |studmin=1 | ||
+ | |studmax=2 | ||
|cfumin=5 | |cfumin=5 | ||
|cfumax=20 | |cfumax=20 | ||
− | |||
− | |||
|resarea=Social Software and Semantic Web | |resarea=Social Software and Semantic Web | ||
|restopic=Semantic Search | |restopic=Semantic Search | ||
− | |level=Bs;Ms | + | |level=Bs; Ms |
− | |type=Course;Thesis | + | |type=Course; Thesis |
− | |status= | + | |status=Closed |
}} | }} | ||
+ | We are used to keyword-based search engines, where only documents matching the exact words in the query are retrieved. In a traditional search engine, if you submit the query "a dinosaur in a university in Lombardy" you won't probably find a document containing the phrase "a velociraptor in Politecnico di Milano", even though it's more or less what you were looking for. | ||
+ | Aim of this project is to expand a traditional search engine with semantic information, so that also documents containing words related to the ones in the query can be retrieved. Existing thesauri and ontologies can be used, as well as more dynamic and collaborative sources of knowledge such as user tags and wikipedia pages and categories. | ||
− | + | Starting points for this work can be the projects "SeQuEx - Semantic Query Expansion" and "Enriching search results with semantic metadata". | |
− | + | ||
− | + | ||
− | + | ||
− | Starting points for this work can be the projects | + | |
;Tools and instruments | ;Tools and instruments | ||
:the project should be preferibly implemented in Java, extending [http://lucene.apache.org/solr/ Solr], an open source enterprise search server based on the [http://lucene.apache.org/ Lucene Java search library]. | :the project should be preferibly implemented in Java, extending [http://lucene.apache.org/solr/ Solr], an open source enterprise search server based on the [http://lucene.apache.org/ Lucene Java search library]. | ||
+ | |||
;Related projects | ;Related projects | ||
:[[SeQuEx | SeQuEx - Semantic Query Expansion]] | :[[SeQuEx | SeQuEx - Semantic Query Expansion]] | ||
:[[Enriching search results with semantic metadata]] | :[[Enriching search results with semantic metadata]] |
Latest revision as of 09:25, 27 May 2010
Title: | Extending a search engine with semantic information |
Image:velociraptor.png |
Description: | We are used to keyword-based search engines, where only documents matching the exact words in the query are retrieved. In a traditional search engine, if you submit the query "a dinosaur in a university in Lombardy" you won't probably find a document containing the phrase "a velociraptor in Politecnico di Milano", even though it's more or less what you were looking for.
Aim of this project is to expand a traditional search engine with semantic information, so that also documents containing words related to the ones in the query can be retrieved. Existing thesauri and ontologies can be used, as well as more dynamic and collaborative sources of knowledge such as user tags and wikipedia pages and categories. Starting points for this work can be the projects "SeQuEx - Semantic Query Expansion" and "Enriching search results with semantic metadata". | |
Tutor: | DavidLaniado (david.laniado@gmail.com), MarcoColombetti (colombet@elet.polimi.it) | |
Start: | Nowwarning.pngThe date "Now" was not understood. | |
Students: | 1 - 2 | |
CFU: | 5 - 20 | |
Research Area: | Social Software and Semantic Web | |
Research Topic: | Semantic Search | |
Level: | Bs, Ms | |
Type: | Course, Thesis | |
Status: | Closed |
We are used to keyword-based search engines, where only documents matching the exact words in the query are retrieved. In a traditional search engine, if you submit the query "a dinosaur in a university in Lombardy" you won't probably find a document containing the phrase "a velociraptor in Politecnico di Milano", even though it's more or less what you were looking for.
Aim of this project is to expand a traditional search engine with semantic information, so that also documents containing words related to the ones in the query can be retrieved. Existing thesauri and ontologies can be used, as well as more dynamic and collaborative sources of knowledge such as user tags and wikipedia pages and categories.
Starting points for this work can be the projects "SeQuEx - Semantic Query Expansion" and "Enriching search results with semantic metadata".
- Tools and instruments
- the project should be preferibly implemented in Java, extending Solr, an open source enterprise search server based on the Lucene Java search library.