Opacus-fusion

Opacus-fusion is an extension of PyTorch Opacus. It allows an fast an efficient DP-SGD training via example-wise weight gradient computation and adaptive clipping.

Prerequisite

CUDA Toolkit, CUDNN, CUBLAS should be installed.

Installation

Set envrionment variables

export ENV_NAME={ENV_NAME}
export OPACUS_FUSION_PATH={OPACUS_FUSION_PATH} # Absolute path
export CUTLASS_PATH={CUTLASS_PATH} # Absolute path

Create conda envrionment

conda create -n $ENV_NAME python=3.9
conda activate $ENV_NAME

Install torch from https://pytorch.org/get-started/locally/
Download opacus-fusion from https://github.com/parkbeomsik/opacus-fusion

git clone https://github.com/parkbeomsik/opacus-fusion.git $OPACUS_FUSION_PATH

Download cutlass from https://github.com/parkbeomsik/cutlass

git clone https://github.com/parkbeomsik/cutlass.git $CUTLASS_PATH

Install cutlass_wgrad_grouped (It will create lib and include in build directory)

cd $OPACUS_FUSION_PATH
cd cutlass_wgrad_grouped
mkdir build && cd build
cmake .. -DCUTLASS_PATH=$CUTLASS_PATH
make install

Install grad_example_module

cd $OPACUS_FUSION_PATH
cd grad_example_module
python setup.py install

Install custom_rnn

cd $OPACUS_FUSION_PATH
cd custom_rnn
python setup.py install

Install opacus-fusion

cd $OPACUS_FUSION_PATH
pip install -e .

Run

Profile time for all cases

export PYTORCH_CUDA_ALLOC_CONF=max_split_size_mb:512
cd $OPACUS_FUSION_PATH/examples
python benchmark_scripts/profile_time_all.py

Profile time

export PYTORCH_CUDA_ALLOC_CONF=max_split_size_mb:512
cd $OPACUS_FUSION_PATH/examples
python benchmark.py --input_size 32 --model_type cnn --architecture resnet18 --dpsgd_mode naive --batch_size 16 --profile_time # DPSGD
python benchmark.py --input_size 32 --model_type cnn --architecture resnet18 --dpsgd_mode reweight --batch_size 16 --profile_time # DPSGD(R)
python benchmark.py --input_size 32 --model_type cnn --architecture resnet18 --dpsgd_mode elegant --batch_size 16 --profile_time # Proposed

Profile memory

cd $OPACUS_FUSION_PATH/examples
python benchmark.py --input_size 32 --model_type cnn --architecture resnet18 --dpsgd_mode naive --batch_size 16 --profile_memory --warm_up_steps 0 --steps 1 # DPSGD
python benchmark.py --input_size 32 --model_type cnn --architecture resnet18 --dpsgd_mode reweight --batch_size 16 --profile_memory --warm_up_steps 0 --steps 1 # DPSGD(R)
python benchmark.py --input_size 32 --model_type cnn --architecture resnet18 --dpsgd_mode elegant --batch_size 16 --profile_memory --warm_up_steps 0 --steps 1 # Proposed

Name		Name	Last commit message	Last commit date
Latest commit History 710 Commits
custom_rnn		custom_rnn
cutlass_wgrad_grouped		cutlass_wgrad_grouped
examples		examples
grad_example_module		grad_example_module
opacus		opacus
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Migration_Guide.md		Migration_Guide.md
README.md		README.md
conftest.py		conftest.py
dev_requirements.txt		dev_requirements.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Opacus-fusion

Prerequisite

Installation

Run

Profile time for all cases

Profile time

Profile memory

About

Releases

Packages

Languages

License

parkbeomsik/opacus-fusion

Folders and files

Latest commit

History

Repository files navigation

Opacus-fusion

Prerequisite

Installation

Run

Profile time for all cases

Profile time

Profile memory

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages