Authors:
Hilton Alers-Valentín
1
;
Carlos G. Rivera-Velázquez
2
;
J. Fernando Vega-Riveros
2
and
Nayda G. Santiago
2
Affiliations:
1
Linguistics and Cognitive Science Program, Department of Hispanic Studies, University of Puerto Rico-Mayagüez, P.O. Box 6000, Mayagüez, 00681-6000 and Puerto Rico
;
2
Department of Electrical and Computer Engineering, University of Puerto Rico-Mayagüez, P.O. Box 6000, Mayagüez, 00681-6000 and Puerto Rico
Keyword(s):
Syntax, Parser, Lexicon, Structural Ambiguity, Computational Linguistics, Natural Language Processing.
Abstract:
This paper presents the current status of a research project in computational linguistics/natural language processing whose main objective is to develop a symbolic, principle-based, bottom-up system in order to process and parse sequences of lexical items as declarative sentences in English. For each input sequence, the parser should produce (maximally) binary trees as generated by the Merge operation on lexical items. Due to parametric variations in the algorithm, the parser should be able to output (up to four) grammatically feasible structural representations accounted by alternative constituent analyses because of structural ambiguities in the parsing of the input string. Finally, the system should be able to state whether a particular string of lexical items is a possible sentence in account of its parsability. The system has a scalable software framework that may be suitable for the analysis of typologically-diverse natural languages.