GitHub - sseung0703/TF2-jit-compile-on-multi-gpu: Tensorflow2 training code with jit compiling on multi-GPU.

Tensorflow2 with jit (xla) compiling on multi-gpu training.

CIFAR and ILSVRC training code with jit compiling and distributed learning on the multi-GPU system.
I highly recommend using Jit compiling because most of the algorithm is static and can be compiled, which gives memory usage reduction and training speed improvement.
This repository is built by custom layers and custom training loop for my project, but if you only want to check how to use jit compiling with distributed learning, check 'train.py' and 'op_util.py'.

python train.py --compile --gpu_id {} --dataset ILSVRC --data_path /path/to/your/ILSVRC/home --train_path /path/to/log

python train.py --compile --gpu_id {} --dataset CIFAR{10,100} --train_path /path/to/log

	Accuracy	Training time
Distributed only	75.83	94.61
Distributed with Jit	75.57	56.98

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
dataloader		dataloader
nets		nets
README.md		README.md
op_utils.py		op_utils.py
test.py		test.py
train.py		train.py
utils.py		utils.py
val_gt.txt		val_gt.txt