Please enable JavaScript.

Coggle requires JavaScript to display documents.

Investigating Active Learning for Short-Answer Scoring Horbach and Palmer…

- - - - Random
      - Cluster centroid
    - - Entropy Sampling
        
        Classifier confidence is computed for each item in
        the unlabeled data, and the one with the highest entropy (lowest confidence) is selected for labeling
      - Boosted Entropy Sampling
        
        We adopt their method of boosted entropy sampling, where per-label weights are incorporated into the entropy computation, in order to favor items more likely
        to belong to a minority class
      - Margin Sampling
        
        this methods tends to select instances that lie on the decision
        border between two classes, instead of items at the
        intersection of all clasess
      - Diversity Sampling
        
        aims to select instances that cover as much of the feature space as possible, i.e. that are as diverse as possible
      - Representativeness Sampling
        
        results in selection of items near the center of the pool
    - - (a) random seed selection
      - (b) equal seed selection
      - Number of items: In the small seed set condition, and for both random and equal selection methods, 10 individual seed sets per prompt are chosen, each with either 3 or 4 seeds (corresponding to the number of classes per prompt). We repeat this process for the large seed set condition, this time selecting 20 items per seed set.
    - - varying batch sizes...