CSAW: Curating and Searching the Annotated Web

Our ambition is to annotate mentions of named entities on billions of Web pages with IDs, thus linking them to entity nodes in Wikipedia. This will enable searching with entities and relationships at an unprecedented scale. The project has two parts: annotating token segments on Web pages with Wikipedia entity IDs, and a new aggregated search mechanism for quantities.

Papers and Talk Slides

Demo, poster, press, etc.

Data

Code

Related projects, services, products, links

Project members

(In approximate order of recency) Soumen Chakrabarti, Uma Sawant, Shashank Gupta, Siddhanth Jain, Hrushikesh Mohapatra, Sasidhar Kasturi, Devshree Sane, Ganesh Ramakrishnan, Apoorv Sharma, , Amit Singh, Sayali Kulkarni, Somnath Banerjee.

Support

Partly supported by grants from Google, HP Labs, Yahoo, Microsoft Research, NetApp and SAP.