Please enable JavaScript.

Coggle requires JavaScript to display documents.

Machine Reading Comprehension(MRC) - Coggle Diagram

- - - - if model complexityincreases then efficiently the parameters are modified
      - powerful feature learning ability
      - end-to-end learning, avoids the modularity of pipelines schemes allowing for optimization to happen easily
      - hardware used for DL (GPU) is constantly being upgraded
      - The community frameworks allow for its ease of use (keras,Tensorflow,Pytorch)
    - - BERT 2018
      - Same level of accuracy as human translator reached in 2018
      - issues: some DL models are 'black boxes' no one knows how the input and output are related.
  - - - Answer forms
      - metrics
        
        F1 score
        
        ROUGE
        
        BLEU
        
        Exact Match
  - - - RACE
      - CNN/DailyMail
      - NewsQA
      - SQuAD
        
        Most InfluentialDataset
      - CoQA
        
        multirun conversational competition
    - - MS Marco
        
        searches in bing, gets the paragraphs, answers questions 2016
      - DuReader
        
        extracts full webpages and answers questions using Baidu 2017
      - QAngaroo
        
        gets the answers from multiple paragraphs, wikipedia and pubmed, 2017
      - HotpotQA
        
        similar to QAngaroo 2018
        
        promotes reasonign rather than text matching
    - - AKA open domain MRC because the large text input is not limited in a single topic
        
        AI2 reasoning challenge
        
        Answers multiple choice scientific test questions 2018
  - - - 3 key components: Articles, questions and anwers.
        
        Generating questions from articles
        
        Givena a paragraph: Labelers "artificially" create questions
        
        Generating articles from questions
        
        your google search history , and the result that suits you, may be used to train an MRC model.
    - - Labelers have subjective bias, 100% true accuracy is impossible :warning:
    - - high quality is to know how close a model reading ability is to that of the humans
        
        Distinguish comprehension-based and matching-based models
        
        as a standard 35% of unanswerable questions have semantically insignificant matches to answers
        
        Evaluate the reasoning capability
        
        use induction
        
        Assess common sense
        
        the weakest point of NLP and machine learning
        
        Other comprehension skills
        
        List | ennumeration
        
        identify, summarize, and sequentially output related concepts
        
        Mathematical operations
        
        Logical Reasoning
        
        Coreference reasolution
        
        understand the pronouns to answer the question
        
        Analogy
        
        Spatial-Temporal Relations
        
        Common sense reasoning
        
        Schematic|rethorical clause relations
        
        Special sentence structure
        
        Causal relations
- - - - usually applied when model needs to generate text
  - - - One-hot-embedding
      - Distributed Representation
    - - Skip-gram
      - implementaion details
  - - - just use perplexity
  - - - Rule-based named entity recognition
      - Feature-based named entity recognition
      - NER based on deep learning
    - - Estimate probabilities in a hidden markov model (HMM)
      - Maximize probabilities in hidden markov model (HMM)
      - Named entity recognition and part-of-speech tagging in
        Python