From 52595a96415d924afa1e9d0f24c8448e2092dbcd Mon Sep 17 00:00:00 2001 From: Ross Wightman Date: Tue, 31 Dec 2024 10:10:52 -0800 Subject: [PATCH] Update README.md --- README.md | 12 ++++++++++++ 1 file changed, 12 insertions(+) diff --git a/README.md b/README.md index bd03704c5..f9deb69cc 100644 --- a/README.md +++ b/README.md @@ -12,6 +12,18 @@ ## What's New +## Dec 31, 2024 +* Add AIM-v2 encoders from https://github.com/apple/ml-aim, see on Hub: https://huggingface.co/models?search=timm%20aimv2 +* Add PaliGemma2 encoders from https://github.com/google-research/big_vision to existing PaliGemma, see on Hub: https://huggingface.co/models?search=timm%20pali2 +* Add missing L/14 DFN2B 39B CLIP ViT, `vit_large_patch14_clip_224.dfn2b_s39b` +* Fix existing RmsProp layer to match standard formulation, use PT 2.5 impl when possible. Move old impl to `SimpleNorm` layer, it's LN w/o centering or bias. There were only two `timm` models using it, and they have been updated. +* Allow overidde of `cache_dir` arg for model creation +* Pass through `trust_remote_code` for HF datasets wrapper +* `inception_next_atto` model added by creator +* Adan optimizer caution, and Lamb decoupled weighgt decay options +* Some feature_info metadata fixed by https://github.com/brianhou0208 +* All OpenCLIP and JAX (CLIP, SigLIP, Pali, etc) model weights that used load time remapping were given their own HF Hub instances so that they work with `hf-hub:` based loading, and thus will work with new Transformers `TimmWrapperModel` + ## Nov 28, 2024 * More optimizers * Add MARS optimizer (https://arxiv.org/abs/2411.10438, https://github.com/AGI-Arena/MARS)