Please enable JavaScript.

Coggle requires JavaScript to display documents.

Commit Messages Completion paper - Coggle Diagram

- - - - Zero-shot English GPT-2
      - Vanilla Transformer trained on the available Commit Messages dataset
    - - Personalization with history
      - Using only changed lines (Practically useful for context size decreasing)
      - Initializing model from pretrained CodeBERT and GPT-2
  - - - For completion there is much lower boundary of quality where it starts to be useful
- - - - Privacy concerns
      - Explain architecture here
  - - - Completion of commit messages is pretty similar to google compose, so we can use metrics from there
      - Maybe add single-token scenario and measure standard completion metrics like accuracy and MRR?
        
        Hmm, is the PrefixMatch metric from google compose trying to capture the same thing?
    - - Remind the described above dataset
      - Deduplication
        
        How to deduplicate Commit Messages? TODO: check how they did this in Generation paper
        
        TODO: revisit this part
      - Split by projects into train/val/test