Efficient Parsing for Natural Language
Author | : Masaru Tomita |
Publisher | : Springer Science & Business Media |
Total Pages | : 209 |
Release | : 2013-04-17 |
ISBN-10 | : 9781475718850 |
ISBN-13 | : 1475718853 |
Rating | : 4/5 (50 Downloads) |
Download or read book Efficient Parsing for Natural Language written by Masaru Tomita and published by Springer Science & Business Media. This book was released on 2013-04-17 with total page 209 pages. Available in PDF, EPUB and Kindle. Book excerpt: Parsing Efficiency is crucial when building practical natural language systems. 'Ibis is especially the case for interactive systems such as natural language database access, interfaces to expert systems and interactive machine translation. Despite its importance, parsing efficiency has received little attention in the area of natural language processing. In the areas of compiler design and theoretical computer science, on the other hand, parsing algorithms 3 have been evaluated primarily in terms of the theoretical worst case analysis (e.g. lXn», and very few practical comparisons have been made. This book introduces a context-free parsing algorithm that parses natural language more efficiently than any other existing parsing algorithms in practice. Its feasibility for use in practical systems is being proven in its application to Japanese language interface at Carnegie Group Inc., and to the continuous speech recognition project at Carnegie-Mellon University. This work was done while I was pursuing a Ph.D degree at Carnegie-Mellon University. My advisers, Herb Simon and Jaime Carbonell, deserve many thanks for their unfailing support, advice and encouragement during my graduate studies. I would like to thank Phil Hayes and Ralph Grishman for their helpful comments and criticism that in many ways improved the quality of this book. I wish also to thank Steven Brooks for insightful comments on theoretical aspects of the book (chapter 4, appendices A, B and C), and Rich Thomason for improving the linguistic part of tile book (the very beginning of section 1.1).