Please enable JavaScript.

Coggle requires JavaScript to display documents.

Model fingerprinting A fingerprint is a piece of information extracted…

- - - - :bulb: show reparametrization destroys this
- - - - Adversarial fingerprinting
        Dominant paradigm. Adversarial sampling exploits the intuition that models tend to be characterized by their decision-boundary. Requires to compute gradients of victim model m.
        [AFA: Adversarial fingerprinting authentication for deep neural networks]
        [TAFA: A Task-Agnostic Fingerprinting Algorithm for Neural Networks]
        Distance between functions using pairs of (x, x_adversarial) [ModelDiff: Testing-Based DNN Similarity Comparison for Model Reuse Detection]
        Using Universarl Adversarial Perturbations [Fingerprinting Deep Neural Networks Globally via Universal Adversarial Perturbations]
        Using conferrable adversarial examples, robustness against extraction [Deep Neural Network Fingerprinting by Conferrable Adversarial Examples]
        Basic strategy using DeepFool [Fingerprinting Deep Neural Networks - a DeepFool Approach]
        DeepJudge [Copy, Right? A Testing Framework for Copyright Protection of Deep Learning Models]
  - - - :bulb: Functional mode connectivity
        If we push the idea further and quotient all symmetries, it should further enhance the comparability between models (ex: how many "real functional modes" are there after a training procedure, marking qualitative differences between implemented functions)
- - - - [Deep Neural Network Fingerprinting by Conferrable Adversarial Examples] is a fingerprinting method robust against extraction
  - - - Finetuning
        Fine-truning can erase hidden signal (backdoor, watermarks) [Fine-Pruning: Defending Against Backdooring Attacks on Deep Neural Networks], but some signal can also be retained [BadNets: Identifying Vulnerabilities in the Machine Learning Model Supply Chain]
      - Model compression
        Techniques to shrink a NN
        
        Model quantization [Model Quantization 1: Basic Concepts]
        
        Pruning
        Fine-Pruning can erase hidden signal(backdoor, watermarks) [Fine-Pruning: Defending Against Backdooring Attacks on Deep Neural Networks]
        [Pruning Filters for Efficient ConvNets]
        
        Knowledge distillation
        Large trained model as teacher, smaller (different architecture) model trained to mimic teacher behavior. We can imagine going beyon raw supervision (prediction only) to include logits, intermediate representations etc
      - Reparametrization
      - Adversarial training
        Few works consider adversarial training in the attacker toolbox, it probably has high evading power
        Considers adv training [Are You Stealing My Model? Sample Correlation for Fingerprinting Deep Neural Networks]
- - - - :bulb: Freeze almost everything and finetune
        Select some weights, such that modifying these weights has almost the least effect on the function. Then, install the backdoor by freezing all other weights.
      - Most general case for the attacker: replace m_1 with m_2 which is functionally equivalent to m_1 on any dataset, except for a backdoor
        If the user has access to (A) both weights it is trivial (B) only has API access it seems impossible even if [Sensitive-Sample Fingerprinting of Deep Neural Networks] argues it has solved the problem.
        :bulb: Show that (B) is impossible by showing that it is possible to run a pre-classifier to chose between using the original model or the corrupted model, and that if it is well designed, it is impossible to make the API tap into the corrupted model. This is all about hiding and picking up a cue.
        Theorem 1 in [High Accuracy and High Fidelity Extraction of Neural Networks] might help showing that a backdoor can easily be undetectable