Author: Ashish Kumar;Avinash Paul
Publisher: Packt Publishing
Publication year: 2016
E-ISBN: 9781782174707
P-ISBN(Paperback): 9781783551811
Subject: TP39 computer application
Keyword: 计算机的应用,自动化技术、计算机技术,计算技术、计算机技术
Language: ENG
Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.
Description
Master text-taming techniques and build effective text-processing applications with R About This Book • Develop all the relevant skills for building text-mining apps with R with this easy-to-follow guide • Gain in-depth understanding of the text mining process with lucid implementation in the R language • Example-rich guide that lets you gain high-quality information from text data Who This Book Is For If you are an R programmer, analyst, or data scientist who wants to gain experience in performing text data mining and analytics with R, then this book is for you. Exposure to working with statistical methods and language processing would be helpful. What You Will Learn • Get acquainted with some of the highly efficient R packages such as OpenNLP and RWeka to perform various steps in the text mining process • Access and manipulate data from different sources such as JSON and HTTP • Process text using regular expressions • Get to know the different approaches of tagging texts, such as POS tagging, to get started with text analysis • Explore different dimensionality reduction techniques, such as Principal Component Analysis (PCA), and understand its implementation in R • Discover the underlying themes or topics that are present in an unstructured collection of documents, using common topic models such as Latent Dirichlet Allocation (LDA) • Build a baseline sentence completing application • Perform entity extraction and named entity recognition using R In Detail Text Mini
Chapter