首頁 > 計(jì)算機(jī)網(wǎng)絡(luò) >
編程語言與程序設(shè)計(jì)
> Natural Language Processing with Java and LingPipe Cookbook最新章節(jié)目錄
目錄(98章)
倒序
- 封面
- 版權(quán)頁
- Credits
- About the Authors
- About the Reviewers
- www.PacktPub.com
- Support files eBooks discount offers and more
- Preface
- What this book covers
- What you need for this book
- Who this book is for
- Conventions
- Reader feedback
- Customer support
- Chapter 1. Simple Classifiers
- Introduction
- Deserializing and running a classifier
- Getting confidence estimates from a classifier
- Getting data from the Twitter API
- Applying a classifier to a .csv file
- Evaluation of classifiers – the confusion matrix
- Training your own language model classifier
- How to train and evaluate with cross validation
- Viewing error categories – false positives
- Understanding precision and recall
- How to serialize a LingPipe object – classifier example
- Eliminate near duplicates with the Jaccard distance
- How to classify sentiment – simple version
- Chapter 2. Finding and Working with Words
- Introduction
- Introduction to tokenizer factories – finding words in a character stream
- Combining tokenizers – lowercase tokenizer
- Combining tokenizers – stop word tokenizers
- Using Lucene/Solr tokenizers
- Using Lucene/Solr tokenizers with LingPipe
- Evaluating tokenizers with unit tests
- Modifying tokenizer factories
- Finding words for languages without white spaces
- Chapter 3. Advanced Classifiers
- Introduction
- A simple classifier
- Language model classifier with tokens
- Na?ve Bayes
- Feature extractors
- Logistic regression
- Multithreaded cross validation
- Tuning parameters in logistic regression
- Customizing feature extraction
- Combining feature extractors
- Classifier-building life cycle
- Linguistic tuning
- Thresholding classifiers
- Train a little learn a little – active learning
- Annotation
- Chapter 4. Tagging Words and Tokens
- Introduction
- Interesting phrase detection
- Foreground- or background-driven interesting phrase detection
- Hidden Markov Models (HMM) – part-of-speech
- N-best word tagging
- Confidence-based tagging
- Training word tagging
- Word-tagging evaluation
- Conditional random fields (CRF) for word/token tagging
- Modifying CRFs
- Chapter 5. Finding Spans in Text – Chunking
- Introduction
- Sentence detection
- Evaluation of sentence detection
- Tuning sentence detection
- Marking embedded chunks in a string – sentence chunk example
- Paragraph detection
- Simple noun phrases and verb phrases
- Regular expression-based chunking for NER
- Dictionary-based chunking for NER
- Translating between word tagging and chunks – BIO codec
- HMM-based NER
- Mixing the NER sources
- CRFs for chunking
- NER using CRFs with better features
- Chapter 6. String Comparison and Clustering
- Introduction
- Distance and proximity – simple edit distance
- Weighted edit distance
- The Jaccard distance
- The Tf-Idf distance
- Using edit distance and language models for spelling correction
- The case restoring corrector
- Automatic phrase completion
- Single-link and complete-link clustering using edit distance
- Latent Dirichlet allocation (LDA) for multitopic clustering
- Chapter 7. Finding Coreference Between Concepts/People
- Introduction
- Named entity coreference with a document
- Adding pronouns to coreference
- Cross-document coreference
- The John Smith problem
- Index 更新時(shí)間:2021-08-05 17:13:04
推薦閱讀
- 少兒人工智能趣味入門:Scratch 3.0動(dòng)畫與游戲編程
- Learn Blockchain Programming with JavaScript
- Linux核心技術(shù)從小白到大牛
- C#程序設(shè)計(jì)教程
- Blockly創(chuàng)意趣味編程
- YARN Essentials
- Java EE 7 Performance Tuning and Optimization
- Python忍者秘籍
- 硅谷Python工程師面試指南:數(shù)據(jù)結(jié)構(gòu)、算法與系統(tǒng)設(shè)計(jì)
- Principles of Strategic Data Science
- Application Development with Parse using iOS SDK
- Java 9 with JShell
- Oracle Database XE 11gR2 Jump Start Guide
- Java自然語言處理(原書第2版)
- Java核心編程
- H5頁面設(shè)計(jì)與制作(全彩慕課版·第2版)
- Learning Apache Thrift
- Mastering PostgreSQL 11(Second Edition)
- Getting Started with SpriteKit
- Apache Cassandra Essentials
- R語言數(shù)據(jù)處理及可視化分析
- 深入理解TypeScript
- AVR單片機(jī)C語言應(yīng)用100例
- C#灰帽子:設(shè)計(jì)安全測試工具
- KVM Virtualization Cookbook
- 數(shù)據(jù)可視化之道:數(shù)據(jù)分析中的圖表制作思路與方法(全彩)
- Node Cookbook
- PhoneGap and AngularJS for Cross:platform Development
- Unity 3D從入門到精通(視頻微課版)
- LabVIEW 2014基礎(chǔ)實(shí)例教程