Create your own natural language training corpus for machine learning. This example-driven book walks you through the annotation cycle, from selecting an annotation task and creating the annotation specification to designing the guidelines, creating a "gold standard" corpus, and then beginning the actual data creation with the annotation process. Systems exist for analyzing existing corpora, but making a new corpus can be extremely complex. To help you build a foundation for your own machine learning goals, this easy-to-use guide includes case studies that demonstrate four different annotation tasks in detail. You'll also learn how to use a lightweight software package for annotating texts and adjudicating the annotations. This book is a perfect companion to O'Reilly's Natural Language Processing with Python, which describes how to use existing corpora with the Natural Language Toolkit.
Product details
- Paperback | 350 pages
- 179.58 x 228.85 x 18.54mm | 548.85g
- 20 Nov 2012
- O'Reilly Media, Inc, USA
- Sebastopol, United States
- English
- Annotated
- annotated ed
- 1449306667
- 9781449306663
- 586,897
Download Natural Language Annotation for Machine Learning (9781449306663).pdf, available at www.thebookosaur.com for free.
0 Comments