“A FRAMEWORK FOR INTEGRATING NATURAL LANGUAGE PROCESSING TOOLS”
João de Almeida Varela Graça,
Departamento de Engenharia Informática –
Abstract:
Natural Language processing (NLP) systems are typically characterized by a pipeline architecture, in which several independently develop NLP tools connected as a chain of filters apply successive transformations to the data that flows through the system. Hence when integrating such tools, one may face problems that lead to information loss, such as: i) tools discard information from their input which is required by other tools; ii) each tool has its own input/output format;
This work proposes a solution to these problems, by using a client server architecture, where the server acts as a blackboard where all tools add/consult the data. The data is kept in the repository under a conceptual model independent of the client tools, which allows the representation of a broad range of linguistic information.
The tools interact with the repository through a generic remote interface which allows the creation of new data and the navigation through all the existing data. Moreover, this work provides libraries implemented in several programming language that abstract the connection and communication protocol details between the NLP tools and the server, and provide several levels of functionality that simplify the creation of NLP tools.
Keywords: Natural Language processing systems, Natural Language tools integration, Repository, Linguistic Annotation, Data lineage, Information loss.
Date: 2006-Apr-26 Time: 14:30:00 Room: ANFITEATRO PA-3 DO EDÍFICIO DE PÓS-GRADUAÇÃO DO IST
For more information:
- susana.costa@inesc.pt
- 213100338
Upcoming Events
INESC Brussels HUB Winter Meeting 2023

This edition of the HUB Winter Meeting will be co-organised with Science Business and will take place on the 30 and 31 January, in Lisbon, at Instituto Superior Técnico, Department of Computer Science and Engineering.
Please see below a summary of the agenda, this will be updated on the INESC Brussels HUB website regularly (confirmed speakers and other relevant info). Places for onsite participation are limited so registration is mandatory. Online participants will be sent a ZOOM link for each specific session on the 27th January.
INESC Brussels HUB website: https://hub.inesc.pt/
Monday, 30 January
a) Digital Europe Programme & Chips Act: state of play and possibilities for INESC.
9h to 10h30 GMT
(Exclusive for INESC researchers and administrators).
b) Science Business: how can INESC tap into Science Business network, activities and communications tools.
(Exclusive for INESC researchers and administrators).
c) Networking Lunch (for all onsite participants).
d) Roundtable: From rhetoric to reality – Embedding international strategy in the DNA of research organisations.
(Closed-door, roundtable workshop, Chatham House rules, open to INESC researchers and administrators, external participants by invitation only).
e) Networking Dinner
(By invitation only – INESC researchers participating onsite in the event are elegible to join).
Tuesday, 31 January
f) Workshop: How they did it? Strategic positioning for structural success in Horizon Europe: a discussion of best practices.
(Exclusive for INESC researchers, administrators and international invited speakers).
g) The public consultation on European R&I Programmes: Towards FP10.
(Closed-door, roundtable workshop, Chatham House rules, open to INESC researchers and administrators, external participants by invitation only).
h) Networking Lunch (for all onsite participants).
i) Management Committee meeting (Directors and POB members)
The HUB Winter Meeting aims at bringing together researchers and administrators from the 5 INESC institutes, affiliated higher education institutions in Portugal and abroad, with key European and global players, to:
– Discuss key research and innovation issues at EU level.
– Inform institutional policy and strategy.
– Exchange best-practices about R&I management, career development and policy positioning.
– Promote, discuss and deliver vision, visibility, networking and impactful communication.
– Create, identify and deepen partnerships and collaboration opportunities for collaborative R&I.