Please enable JavaScript.

Coggle requires JavaScript to display documents.

Accents and Languages Planning (Accents (infrastructure (pronunciation…

- - - - common voice
      - scottish/english DB existing
      - combilex lexicon
    - - limited voices
    - - unsupervised pre-training followed by supervised training
      - get more transcribed
        
        need more volume
        
        100 hours per accent (too low for specific accent models)
        
        200+ for good models
        
        send more through pipegood?
        
        Transcriber team not great at identifying or transcribing accents
    - - Simon briefly experimented with this and it didn't work
  - - - experiment
    - - LM more important with specific models
    - - Simon says we can't do FMLR in realtime, need to know end of sentence, speaker info, etc.
      - auto encoder to remove accent "noise"
    - - map to british pronunciations
      - different phoneme set
      - need to re-map OOVs
    - - accented samples and do training
- - - - local based for V1
      - User setting
  - - - new punctuator
      - new training data
      - subword units
      - audio data
      - lexicon/lexicographers
      - transcribers
      - new textproc, metrics
      - we have to support it going forward
      - DATA!