S. Pado, T.-G. Noh, A. Stern, R. Wang, R. Zanoli: Design and Realization of a Modular Architecture for Textual Entailment. Natural Language Engineering 21(2), 167--200. 2015. doi:10.1017/S1351324913000351. Copyright Cambridge University Press, 2013.


A key challenge at the core of many NLP tasks is the ability to determine which conclusions can be inferred from a given natural language text. This problem, called the Recognition of Textual Entailment (RTE), has initiated the development of a range of algorithms, methods and technologies.

Unfortunately, research on TE (like semantics research more generally), is fragmented into studies focussing on various aspects of semantics such as world knowledge, lexical and syntactic relations, or more specialized kinds of inference. This fragmentation has problematic practical consequences. Notably, interoperability among existing RTE systems is poor, and reuse of resources and algorithms is mostly infeasible. This also makes systematic evaluations very difficult to carry out. Finally, TE presents a wide array of approaches to potential end users with little guidance on which to pick.

Our contribution to this situation is the novel EXCITEMENT architecture, which was developed to enable and encourage the consolidation of methods and resources in the TE area. It decomposes RTE into components with strongly typed interfaces. We specify (a) a modular linguistic analysis pipeline and (b) a decomposition of the ``core'' RTE methods into top-level algorithms and subcomponents. We identify four major subcomponent types, including knowledge bases and alignment methods. The architecture was developed with a focus on generality, supporting all major approaches to RTE and encouraging language independence. We illustrate the feasibility of the architecture by constructing mappings of major existing systems onto the architecture.

The practical implementation of this architecture forms the EXCITEMENT open platform (EOP). It is a suite of textual entailment algorithms and components which contains the three systems named above, including linguistic-analysis pipelines for three languages (English, German, and Italian), and comprises a number of linguistic resources. By addressing the problems outlined above, the platform provides a comprehensive and flexible basis for research and experimentation in Textual Entailment and is available as open source software under the GPL license.


@Article{pado13:_desig_realiz_modul_archit_textual_entail,
  author = 	 {Sebastian Pad\'o and Tae-Gil Noh and Asher Stern and 
                  Rui Wang and Roberto Zanoli},
  title = 	 {Design and Realization of a Modular Architecture for 
                  Textual Entailment},
  journal = 	 {Journal of Natural Language Engineering},
  year = 	 2015,
  pages = {167--200},
  volume = {21},
  number = 2,
  url = {http://dx.doi.org/10.1017/S1351324913000351}
}