Please enable JavaScript.
Coggle requires JavaScript to display documents.
Text and Web Analytics Week 8 (Challenges of Text Mining (NLP) (Part-of…
Text and Web Analytics
Week 8
Challenges of Text Mining (NLP)
Part-of-speech tagging: depends not only on definition of
term but also on the context used
Speech acts and semantic analysis: understanding the
meaning of words
Text segmentation : eg analysis of free-form text found in
e-mails and recorded telephone transcripts
Text contains acronyms, abbreviations, misspellings. e.g.
customer, cust, customar, csmr
Imperfect or irregular input eg foreign accents
Web Mining
Web is the largest repository of data
Challenges
The Web is too big for effective data mining
The Web is too complex
The Web is too dynamic
The Web is not specific to a domain
The Web has everything
Web mining is the process of discovering intrinsic
relationships from Web data
Sentiment Analysis
gets data from full set of customer touch points
VOC is a key element of customer experience
management initiates
Voice of the market (VOM) : understanding
aggregate opinions and trends.
Web Usage Mining (Web Analytics)
Text Mining Application Area
Clustering: group similar documents together
Summarization: to save time for the reader
Information extraction: identification of key phrases and relationships within text by looking for patterns