Named-entity recognition (NER) is a tool that aims to identify (people, places and things...) in a text.
You can submit your document and get a list of the people, places or things that are mentioned in it.
This can be useful in many information processing tasks - for example, you could scrape the web and do keyword searches to find all the pages that mention "Ursula von der Leyen" in researching the EU, or perhaps, if you have a taste for controversy, "Donald Trump". This can be useful in monitoring media interest in individuals or places that are currently in the news. For machine translation, it can be used to identify individuals or places and ensure that the official or preferred translations are used.
This system will generate a marked-up list - what you then do with it is limited only by your imagination!
The CEF named-entity recognition service expects a 64 base-encoded file as input and returns a tagged xml file as output.
The CEF Named-entity recognition web service is asynchronous. This means that the client sends a NER request and is notified once the document has been processed. In this way, calling the web service does not block the client. However, the client needs to expose a callback URL which will receive a notification that the named-entity extraction has been completed. The NER web service sends the tagged xml file to the destination URL specified by the client.
The interaction is as follows:
The above approach decouples the client from the NER server. It is the responsibility of the Client to submit the initial request, and to react to the callback from the NER server. The request ID returened can be used to correlate the original request with the callback it refers to.
To obtain the credentials needed to access the service:
National public administrations in the EU Member States, Iceland and Norway
: Free of charge
Helpdesk:
Interaction Language: English (Latin)
Request - Web Service : https://language-tools.ec.europa.eu/NamedEntitiesWS/askNER
Input | Output | |
---|---|---|
Media Type |
|
|
Data format | HTML , MS-Excel xls , MS-Excel xlsx , MS-Word doc , MS-Word docx , PDF , RTF , Text , XML | XML |
Data format (other) | odt, ods, odp, odg, ott, ots, otp, otg, ppt, ppts |