Skip to content

Latest commit

 

History

History
15 lines (11 loc) · 931 Bytes

README.md

File metadata and controls

15 lines (11 loc) · 931 Bytes

Entity Linking

After the entity mentions in the document are extracted, we linked these mentions with their named entities in the Knowledge Base. This phase can be divided into the following steps:

Query the Knowledge Base

If the mention is English words, input surface form of mentions to ElasticSearch to get 20 candidate named entities with the highest popularity score from FreeBase.

Rank the candidate named entities

We assign a score to each of the candidates, and rank them according to the score.

  • Check if the mention and the candidate is exactly the same. If so, set the score as Inf.
  • Calculate the cosine similarity of candidates' abstract and the context containing the mention.
  • We assign a new score for every candidate - the sum of the popularity score and the cosine similarity score.

Link the mention and the candidates

We select the candidate with the highest score to be the entity meaning.