Please enable JavaScript.
Coggle requires JavaScript to display documents.
Natural Language Processing(NLP) - Coggle Diagram
Natural Language Processing(NLP)
Definition
Natural language processing (NLP) is the ability of a computer program to understand human language as it's spoken and written referred to as natural language. It's a component of artificial intelligence (AI).
Interactions between computers and human languages.
Reference
Formal language:
https://www.touro.edu/departments/writing-center/tutorials/formal-vs-informal-language/
Linguistics:
https://arts-sciences.buffalo.edu/linguistics/about/what-is-linguistics.html
language:
https://study.com/learn/lesson/language-meaning-concept.html
Sentiment Analysis:
https://aws.amazon.com/what-is/sentiment-analysis/#
:
Cognitive Psychology:
https://www.simplypsychology.org/cognitive.html
Natural language processing:
https://www.techtarget.com/searchenterpriseai/definition/natural-language-processing-NLP#
:
Machine Learning:
https://www.geeksforgeeks.org/machine-learning/
N-gram:
https://botpenguin.com/glossary/n-gram
co-reference resolution:
https://www.netguru.com/glossary/coreference-resolution
Machine translation:
https://aws.amazon.com/what-is/machine-translation/
History of NLP
1950s
Turing “computer machinery & intelligence” test
1954
Georgetown experiment - auto translation from Russian to English
1960s
Eliza - (Weizenbaum) psychotherapist
Restricted vocabulary set
Works in “blocks world”
1970s
chatterbots
Programs structure real-world information into computer-understandable data
Conceptual ontologies
1980s
Machine learning algorithms
Advantages
Applications
Challenging areas
Domain-specific language
Ambiguity Resolution
Contextual Understanding
Real-time NLP and Scalability
Sarcasm and Irony Detection
Common sense (understanding the world)
Replicate human thinking
Logical variations
Sentiment Analysis
Sentiment analysis is the process of analyzing digital text to determine if the emotional tone of the message is positive, negative, or neutral.
Sentiment analysis is the task of classifying the polarity of a given text.
language
language is a term for any complex communication system used by humans that consists of words & phrases that, when combined, can create infinite variable utterances.
Conversation
Collaboration
Communication
Co-creation
Formal Vs Natural language
Formal language
EX.
Python, Java, C
Semantics (meaning)
It refers to the meaning associated with the statement in a programming language.
Rules (grammar)
It refers to the rules and regulations for writing any statement in a programming language like C/C++.
Formal language is less personal than informal language.
Human language
No single meaning, but probability distribution
Large and constantly changing
Ambiguity “he saw her duck”
Human language is unique among all forms of animal communication.
Need to Understand
Computer Science
Concerned with developing internal representations of data and efficient processing of these structures.
Cognitive Psychology
Cognitive psychology studies mental processes, including how people perceive, think, remember, learn, solve problems, and make decisions.
Linguistics
Morphology
Root
Prefix
Suffix
Syntax
Pragmatics
Semantics
Lexicon
Phonetics
Phonetics Chart
Phonology
Discourse Analysis
Aim
Classify speech acts
Linguistics is the scientific study of language, and its focus is the systematic investigation of the properties of particular languages as well as the characteristics of language in general.
Machine learning
Machine learning is a branch of Artificial Intelligence that focuses on developing models & algorithms that let computers learn from data without being explicitly programmed for every task.
Advantages
Auto-focus on most common cases
Use stats to deal with unfamiliar input
Improves with more data, not more programming
Applications - clever searches
Challenges
How can PC derive meaning from input
Linguistic
Good progress
Sentiment analysis
Translation
Big Data
Sarcasm
Really hard
Paraphrase
Summarise-xml markup
Dialogue
Mostly solved
POS
English ok, Chinese tonal language mighty difficult Using Hidden Markov Models to figure probability of sequences
Spelling correction
NER
Named entity recognition
Language identification
Spelling correction
Language identification
OCR
Open character recognition Scanning, passwords
Natural language understanding
OCR
N-gram character model
N-Gram is simply a sequence of N words, where N can be any positive integer.
Markov chains
A probabilistic model that predicts the next state (e.g., word) based on the current state and the transition probabilities learned from the corpus.
Prediction on corpus of text
Co-reference resolution
Coreference resolution is a critical aspect of natural language processing (NLP), a subfield of artificial intelligence (AI).
Good progress
Parsing - grammatical analysis of sentence - ambiguous
IE - Google calendar
Genre classification
Crash blossoms
Machine translation
Machine translation is the process of using artificial intelligence to automatically translate text from one language to another without human involvement.
Word knowledge
Common Sense - logic
Logical variations
Involves a variety of domains
A little space and physics
Perception
Time
Naive psychology
Multi-agents
Knowledge
Replicate human thinking
Reason with knowledge that is true by default
Reason rapidly across a broad range of domains
Tolerate uncertainty in your knowledge
Differences between formal and natural languages
Natural Languages: These are languages that have developed organically through use by humans for everyday communication
Formal Languages: These are constructed languages with a specific set of rules and symbols, often used in mathematical logic, computer science, and linguistics
Types of words
Lexicon
Understanding everything about distinct words according to their position in the speech, their meanings and their relation to other words.
Types
printed
specialized
Mental
French Idioms
Tomber dans les pommes = faint / pass out
Avoir un poil dans la main = to be lazy
Être une bonne poire = to be naive
Faire le pont = have a long weekend
Ce ne sont pas vos oignons = none of your business
Lexicon of comics
Lewis Carroll
Syntax
Analyzing the words of a sentence
to uncover the grammatical structure
Pragmatics
Explains how extra meaning is read into texts without actually being encoded in them.
Semantics
The possible meanings of a sentence by focusing on the interactions among word-level meanings in the sentence
Ontologies
The Semantic Web is an extension of the existing World Wide Web.
It provides a standardized way of expressing the relationships between web pages, to allow machines to understand the meaning of hyperlinked information