
|
Subscribe to our newsletter and you'll continuously
be informed about our new books, series and journals.
You can customize this e-mail newsletter to your particular
needs and interests.
Newsletters include special discount codes.
To subscribe or change existing preferences press the subscribe button
|
|
|

| Series and Journals |
|
 |
A resource-light approach to morpho-syntactic tagging.
Feldman, Anna and Jirka Hana
Amsterdam/New York, NY, 2010, XIV, 185 pp.
|
Series: Language and Computers - Studies in Practical Linguistics 70
While supervised corpus-based methods are highly accurate for different NLP tasks, including morphological tagging, they are difficult to port to other languages because they require resources that are expensive to create. As a result, many languages have no realistic prospect for morpho-syntactic annotation in the foreseeable future. The method presented in this book aims to overcome this problem by significantly limiting the necessary data and instead extrapolating the relevant information from another, related language. The approach has been tested on Catalan, Portuguese, and Russian. Although these languages are only relatively resource-poor, the same method can be in principle applied to any inflected language, as long as there is an annotated corpus of a related language available. Time needed for adjusting the system to a new language constitutes a fraction of the time needed for systems with extensive, manually created resources: days instead of years. This book touches upon a number of topics: typology, morphology, corpus linguistics, contrastive linguistics, linguistic annotation, computational linguistics and Natural Language Processing (NLP). Researchers and students who are interested in these scientific areas as well as in cross-lingual studies and applications will greatly benefit from this work. Scholars and practitioners in computer science and linguistics are the prospective readers of this book.
Contents List of tables List of figures Preface Introduction Common tagging techniques Previous resource-light approaches to NLP Languages, corpora and tagsets Quantifying language properties Resource-light morphological analysis Cross-language morphological tagging Summary and further work Bibliography Appendices: Tagsets we use; Corpora; Language properties Citation Index
Anna Feldman is an assistant professor of linguistics and computer science at Montclair State University. She received her Ph.D. from The Ohio State University. For more information on her research, please, visit http://purl.org/net/fa
Jirka Hana is a researcher at Charles University in Prague. He holds a Ph.D. degree in linguistics from The Ohio State University and a doctoral degree in computer science from Charles University. He has published numerous articles in computational linguistics. For more information, please,visit http://purl.org/net/jh
|
|
|


|
Tijnmuiden 7
1046 AK Amsterdam
The Netherlands
T: +31-20-611 48 21
F: +31-20-447 29 79
248 East 44th Street - 2nd floor
New York, NY 10017
USA
T: 1-800-225-3998
F: 1-800-853-3881
Toll-free in the USA
info@rodopi.nl
|
|
|