transformers/docker/transformers-all-latest-gpu/Dockerfile

FROM nvidia/cuda:11.2.2-cudnn8-devel-ubuntu20.04
LABEL maintainer="Hugging Face"

ARG DEBIAN_FRONTEND=noninteractive

# Use login shell to read variables from `~/.profile` (to pass dynamic created variables between RUN commands)
SHELL ["sh", "-lc"]

# The following `ARG` are mainly used to specify the versions explicitly & directly in this docker file, and not meant
# to be used as arguments for docker build (so far).

ARG PYTORCH='1.12.0'
# (not always a valid torch version)
ARG INTEL_TORCH_EXT='1.11.0'
# Example: `cu102`, `cu113`, etc.
ARG CUDA='cu113'

RUN apt update
RUN apt install -y git libsndfile1-dev tesseract-ocr espeak-ng python3 python3-pip ffmpeg git-lfs
RUN git lfs install
RUN python3 -m pip install --no-cache-dir --upgrade pip

ARG REF=main
RUN git clone https://github.com/huggingface/transformers && cd transformers && git checkout $REF
RUN python3 -m pip install --no-cache-dir -e ./transformers[dev,onnxruntime]

# TODO: Handle these in a python utility script
RUN [ ${#PYTORCH} -gt 0 -a "$PYTORCH" != "pre" ] && VERSION='torch=='$PYTORCH'.*' ||  VERSION='torch'; echo "export VERSION='$VERSION'" >> ~/.profile
RUN echo torch=$VERSION
# `torchvision` and `torchaudio` should be installed along with `torch`, especially for nightly build.
# Currently, let's just use their latest releases (when `torch` is installed with a release version)
# TODO: We might need to specify proper versions that work with a specific torch version (especially for past CI).
RUN [ "$PYTORCH" != "pre" ] && python3 -m pip install --no-cache-dir -U $VERSION torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/$CUDA || python3 -m pip install --no-cache-dir -U --pre torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/nightly/$CUDA

RUN python3 -m pip install --no-cache-dir -U tensorflow
RUN python3 -m pip uninstall -y flax jax

# Use installed torch version for `torch-scatter` to avid to deal with PYTORCH='pre'.
# If torch is nightly version, the link is likely to be invalid, but the installation falls back to the latest torch-scatter
RUN python3 -m pip install --no-cache-dir torch-scatter -f https://data.pyg.org/whl/torch-$(python3 -c "from torch import version; print(version.__version__.split('+')[0])")+$CUDA.html
RUN python3 -m pip install --no-cache-dir intel_extension_for_pytorch==$INTEL_TORCH_EXT+cpu -f https://software.intel.com/ipex-whl-stable

RUN python3 -m pip install --no-cache-dir git+https://github.com/facebookresearch/detectron2.git pytesseract https://github.com/kpu/kenlm/archive/master.zip
RUN python3 -m pip install -U "itsdangerous<2.1.0"

RUN python3 -m pip install --no-cache-dir git+https://github.com/huggingface/accelerate@main#egg=accelerate

# Add bitsandbytes for mixed int8 testing
RUN python3 -m pip install -i https://test.pypi.org/simple/ bitsandbytes==0.31.5

RUN python3 -m pip install --no-cache-dir decord

# When installing in editable mode, `transformers` is not recognized as a package.
# this line must be added in order for python to be aware of transformers.
RUN cd transformers && python3 setup.py develop
Dcoker images runtime -> devel (#16141) * Runtime -> Devel * Torch before DeepSpeed 2022-03-15 00:37:20 +08:00			`FROM nvidia/cuda:11.2.2-cudnn8-devel-ubuntu20.04`
[Test refactor 5/5] Build docker images (#15729) 2022-02-24 04:48:19 +08:00			`LABEL maintainer="Hugging Face"`

			`ARG DEBIAN_FRONTEND=noninteractive`

Enable PyTorch nightly build CI (#17335) * nightly build pytorch CI * fix working dir * change time and event name Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-17 22:42:27 +08:00			# Use login shell to read variables from `~/.profile` (to pass dynamic created variables between RUN commands)
			`SHELL ["sh", "-lc"]`

Explicit versions in docker files (#17586) * Update docker file Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-08 21:04:22 +08:00			# The following `ARG` are mainly used to specify the versions explicitly & directly in this docker file, and not meant
			`# to be used as arguments for docker build (so far).`

PyTorch 1.12.0 for scheduled CI (#17949) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-30 01:32:19 +08:00			`ARG PYTORCH='1.12.0'`
Explicit versions in docker files (#17586) * Update docker file Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-08 21:04:22 +08:00			`# (not always a valid torch version)`
			`ARG INTEL_TORCH_EXT='1.11.0'`
			# Example: `cu102`, `cu113`, etc.
			`ARG CUDA='cu113'`

[Test refactor 5/5] Build docker images (#15729) 2022-02-24 04:48:19 +08:00			`RUN apt update`
CLI: add stricter automatic checks to `pt-to-tf` (#17588) * Stricter pt-to-tf checks; Update docker image for related tests * check all attributes in the output Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> 2022-06-08 17:45:10 +08:00			`RUN apt install -y git libsndfile1-dev tesseract-ocr espeak-ng python3 python3-pip ffmpeg git-lfs`
			`RUN git lfs install`
[Test refactor 5/5] Build docker images (#15729) 2022-02-24 04:48:19 +08:00			`RUN python3 -m pip install --no-cache-dir --upgrade pip`

Rename master to main for notebooks links and leftovers (#16397) 2022-03-25 21:12:23 +08:00			`ARG REF=main`
[Test refactor 5/5] Build docker images (#15729) 2022-02-24 04:48:19 +08:00			`RUN git clone https://github.com/huggingface/transformers && cd transformers && git checkout $REF`
			`RUN python3 -m pip install --no-cache-dir -e ./transformers[dev,onnxruntime]`

Enable PyTorch nightly build CI (#17335) * nightly build pytorch CI * fix working dir * change time and event name Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-17 22:42:27 +08:00			`# TODO: Handle these in a python utility script`
			`RUN [ ${#PYTORCH} -gt 0 -a "$PYTORCH" != "pre" ] && VERSION='torch=='$PYTORCH'.*' \|\| VERSION='torch'; echo "export VERSION='$VERSION'" >> ~/.profile`
			`RUN echo torch=$VERSION`
			# `torchvision` and `torchaudio` should be installed along with `torch`, especially for nightly build.
			# Currently, let's just use their latest releases (when `torch` is installed with a release version)
			`# TODO: We might need to specify proper versions that work with a specific torch version (especially for past CI).`
			`RUN [ "$PYTORCH" != "pre" ] && python3 -m pip install --no-cache-dir -U $VERSION torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/$CUDA \|\| python3 -m pip install --no-cache-dir -U --pre torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/nightly/$CUDA`

Use latest stable PyTorch/DeepSpeed for Push & Scheduled CI (#17417) * update versions Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-07 17:53:05 +08:00			`RUN python3 -m pip install --no-cache-dir -U tensorflow`
[Test refactor 5/5] Build docker images (#15729) 2022-02-24 04:48:19 +08:00			`RUN python3 -m pip uninstall -y flax jax`
Use latest stable PyTorch/DeepSpeed for Push & Scheduled CI (#17417) * update versions Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-07 17:53:05 +08:00
Enable PyTorch nightly build CI (#17335) * nightly build pytorch CI * fix working dir * change time and event name Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-17 22:42:27 +08:00			# Use installed torch version for `torch-scatter` to avid to deal with PYTORCH='pre'.
			`# If torch is nightly version, the link is likely to be invalid, but the installation falls back to the latest torch-scatter`
			`RUN python3 -m pip install --no-cache-dir torch-scatter -f https://data.pyg.org/whl/torch-$(python3 -c "from torch import version; print(version.__version__.split('+')[0])")+$CUDA.html`
Explicit versions in docker files (#17586) * Update docker file Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-08 21:04:22 +08:00			`RUN python3 -m pip install --no-cache-dir intel_extension_for_pytorch==$INTEL_TORCH_EXT+cpu -f https://software.intel.com/ipex-whl-stable`
Use latest stable PyTorch/DeepSpeed for Push & Scheduled CI (#17417) * update versions Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-07 17:53:05 +08:00
[Test refactor 5/5] Build docker images (#15729) 2022-02-24 04:48:19 +08:00			`RUN python3 -m pip install --no-cache-dir git+https://github.com/facebookresearch/detectron2.git pytesseract https://github.com/kpu/kenlm/archive/master.zip`
			`RUN python3 -m pip install -U "itsdangerous<2.1.0"`

install dev. version of accelerate (#17243) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-05-14 01:47:09 +08:00			`RUN python3 -m pip install --no-cache-dir git+https://github.com/huggingface/accelerate@main#egg=accelerate`

`bitsandbytes` - `Linear8bitLt` integration into `transformers` models (#17901) * first commit * correct replace function * add final changes - works like charm! - cannot implement tests yet - tested * clean up a bit * add bitsandbytes dependencies * working version - added import function - added bitsandbytes utils file * small fix * small fix - fix import issue * fix import issues * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * refactor a bit - move bitsandbytes utils to utils - change comments on functions * reformat docstring - reformat docstring on init_empty_weights_8bit * Update src/transformers/__init__.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * revert bad formatting * change to bitsandbytes * refactor a bit - remove init8bit since it is useless * more refactoring - fixed init empty weights issue - added threshold param * small hack to make it work * Update src/transformers/modeling_utils.py * Update src/transformers/modeling_utils.py * revmoe the small hack * modify utils file * make style + refactor a bit * create correctly device map * add correct dtype for device map creation * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply suggestions - remove with torch.grad - do not rely on Python bool magic! * add docstring - add docstring for new kwargs * add docstring - comment `replace_8bit_linear` function - fix weird formatting * - added more documentation - added new utility function for memory footprint tracking - colab demo to add * few modifs - typo doc - force cast into float16 when load_in_8bit is enabled * added colab link * add test architecture + docstring a bit * refactor a bit testing class * make style + refactor a bit * enhance checks - add more checks - start writing saving test * clean up a bit * male style * add more details on doc * add more tests - still needs to fix 2 tests * replace by "or" - could not fix it from GitHub GUI Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * refactor a bit testing code + add readme * make style * fix import issue * Update src/transformers/modeling_utils.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * add few comments * add more doctring + make style * more docstring * raise error when loaded in 8bit * make style * add warning if loaded on CPU * add small sanity check * fix small comment * add bitsandbytes on dockerfile * Improve documentation - improve documentation from comments * add few comments * slow tests pass on the VM but not on the CI VM * Fix merge conflict * make style * another test should pass on a multi gpu setup * fix bad import in testing file * Fix slow tests - remove dummy batches - no more CUDA illegal memory errors * odify dockerfile * Update docs/source/en/main_classes/model.mdx * Update Dockerfile * Update model.mdx * Update Dockerfile * Apply suggestions from code review * few modifications - lm head can stay on disk/cpu - change model name so that test pass * change test value - change test value to the correct output - torch bmm changed to baddmm in bloom modeling when merging * modify installation guidelines * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * replace `n`by `name` * merge `load_in_8bit` and `low_cpu_mem_usage` * first try - keep the lm head in full precision * better check - check the attribute `base_model_prefix` instead of computing the number of parameters * added more tests * Update src/transformers/utils/bitsandbytes.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Merge branch 'integration-8bit' of https://github.com/younesbelkada/transformers into integration-8bit * improve documentation - fix typos for installation - change title in the documentation Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> 2022-08-10 15:13:36 +08:00			`# Add bitsandbytes for mixed int8 testing`
			`RUN python3 -m pip install -i https://test.pypi.org/simple/ bitsandbytes==0.31.5`

[VideoMAE] Add model to doc tests (#18523) * Add videomae to doc tests * Add pip install decord Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> 2022-08-09 01:28:51 +08:00			`RUN python3 -m pip install --no-cache-dir decord`

[Test refactor 5/5] Build docker images (#15729) 2022-02-24 04:48:19 +08:00			# When installing in editable mode, `transformers` is not recognized as a package.
			`# this line must be added in order for python to be aware of transformers.`
			`RUN cd transformers && python3 setup.py develop`