Data Extraction From Wikis
Use of ontologies for the extraction of structured data from wikis
| |
Short Description: | Development of a Java application for the extraction of data from wikis and their reorganization inside an ontology |
Coordinator: | |
Tutor: | DavideEynard (eynard@elet.polimi.it) |
Collaborator: | |
Students: | CarloMiglierina (carlo.miglierina@gmail.com) |
Research Area: | Social Software and Semantic Web |
Research Topic: | |
Start: | 2008/10/28 |
End: | 2009/09/5 |
Status: | Closed |
Level: | Bs |
Type: | Thesis |
Contents
Part 1: project profile
Project name
Use of ontologies for the extraction of structured data from wikis
Project short description
Wikipedia is the largest and most known example of wiki. There is a lot of information inside wikis that are built using its same technology, and a lot of users who create and edit their pages. But these free encyclopedias have a disadvantage: data is not structured and so it is not possible to do advanced researches. Moreover, computers cannot process these data. The aim of this project is to create a Java application that extracts semi-structured data from wiki templates and infoboxes and puts them inside an ontology, in order to have structured data. Using the ontology it is possible to do advanced researches, as computers can process these data. As an example, this application has been used to organize data about the characters of "The lord of the rings".
Dates
Start date: 2008/10/28
End date: 2009/09/05
People involved
Project Advisor
Students
Students currently working on the project