Please enable JavaScript.

Coggle requires JavaScript to display documents.

24badfb4478e4ccdb5ce060eb88e4b5d Schedule Slides Videos Opinionated…

- - - - :fountain_pen: Fair decision making using privacy-protected data (Duke, UMass)
        
        Setting: Sensitive personal data is used to decide who will receive resources or benefits
        
        RQ: Impact of differential privacy on fair and equitable decision-making
        
        Real-word scenarios
        
        Takeaways:
        
        If decisions are made using an ϵ-differentially private version of the data, under strict privacy constraints (smaller ϵ), the noise added to achieve privacy may disproportionately impact some groups over others
        
        Designers of privacy algorithms must evaluate the fairness of outcomes, in addition to conventional aggregate error metrics that have historically been their focus
        
        Optimizing for aggregate error on published statistics does not reliably lead to more accurate or fair outcomes for a downstream decision problem (DAWA vs. Laplace).
        
        Scenario I: Minority language voting right
        
        Task: Binary decision rule based on district statistics (which are released ϵ-DP)
        
        Fairness metrics: Equality for all jurisdictions P(covered | data)(randomness comes from the DP algorithm)
        
        Findings:
        
        There are significant disparities in the rate of correct classification across jurisdictions
        
        Significant differences in the rate of successful classification across jurisdictions is a consequence of the decision rule and its interaction with the noise added for privacy
        
        A jurisdiction’s distance from the nearest threshold explains classification rates for D-Laplace but not DAWA (it exacerbates disparities)
        
        Mitigating unfairness: estimate the posterior probability that the jurisdiction is Covered given the observed noisy counts, and set a threshold (trade-off between FP and FN)
        Scenario II: Title I funds allocation
        At least $675 billion dollars relies on data released by the Census Bureau
        Scenario III: Apportionment of legislative representative at
        
        Data- and Workload-Aware (DAWA): Introduce complex noise that is adapted to the input data.
    - - :fountain_pen: Assessing Algorithmic Fairness with Unobserved Protected Class Using Data Combination (Cornell)
        
        What to do when you don't have sensitive attribute about individuals
        
        E.g., in lending it is illegal to hold race
        
        Industry standard: use proxies (e.g., surname and geolocation)
        
        Data combination: main dataset (X, Y, Z) & auxiliary dataset (Z, A)
        
        In the paper, they suggest "new optimization-based algorithms for obtaining sharp partial-identification bounds on said disparities"
        
        They run to experiments on mortgage and medical dosing
    - - :fountain_pen: Fairness Warnings and Fair-MAML: Learning Fairly with Minimal Data (UCI, Haverford)
        
        Fairness Warnings
        
        Model-agnostic algorithm that provides interpretable boundary conditions for when a fairly trained model may not behave fairly on similar but slightly different tasks within a given domain.
        
        Training interpretable model to predict which mean-shift causes to unfairness of a classifier.
        
        Fair-MAML
        
        Fair meta-learning approach to train models that can be quickly fine-tuned to specific tasks from only a few number of sample in- stances while balancing fairness and accuracy
        
        K-shot fairness, i.e. training a fair model on a new task with only K data points.
        
        There is more in the paper