Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Windows wheel build error - Tried everything with all requirements you have #6871

Open
FurkanGozukara opened this issue Dec 14, 2024 · 6 comments
Assignees
Labels
build Improvements to the build and testing systems. windows Questions or PRs relating to running DeepSpeed on Windows

Comments

@FurkanGozukara
Copy link

I need compile DeepSpeed for Python 3.10

I have literally everything setup. The compile did run like 20 minute or more

However failed later

Now as a Microsoft you are forcing us to use DeepSpeed on Linux is ridiculous if you ask anyone opinions

So i got build tools totally installed - 2022

Image

I got python 3.10.11 and CUDA 12.4 installed

The first error is that you have to fix setup.py git path since it works only on Linux. I did that

# Write out version/git info.
git_hash_cmd = ["git", "rev-parse", "--short", "HEAD"]
git_branch_cmd = ["git", "rev-parse", "--abbrev-ref", "HEAD"]
if command_exists('git') and not is_env_set('DS_BUILD_STRING'):
    try:
        result = subprocess.check_output(git_hash_cmd, shell=True)
        git_hash = result.decode('utf-8').strip()
        result = subprocess.check_output(git_branch_cmd, shell=True)
        git_branch = result.decode('utf-8').strip()
    except subprocess.CalledProcessError:
        git_hash = "unknown"
        git_branch = "unknown"
else:
    git_hash = "unknown"
    git_branch = "unknown"

After that compile starts

Image

And finally compile ends with below error

cl : Command line warning D9025 : overriding '/D__CUDA_NO_HALF2_OPERATORS__' with '/U__CUDA_NO_HALF2_OPERATORS__'
slice_attn_masks.cu
tmpxft_0000b2e4_00000000-7_slice_attn_masks.compute_90.cudafe1.cpp
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.4\include\cuda_fp16.hpp(992): fatal error C1001: Internal compiler error.
(compiler file 'D:\a\_work\1\s\src\vctools\Compiler\CxxFE\sl\p1\c\token_io.cpp', line 2687)
 To work around this problem, try simplifying or changing the program near the locations listed above.
If possible please provide a repro here: https://developercommunity.visualstudio.com
Please choose the Technical Support command on the Visual C++
 Help menu, or open the Technical Support help file for more information
error: command 'C:\\Program Files\\NVIDIA GPU Computing Toolkit\\CUDA\\v12.4\\bin\\nvcc.exe' failed with exit code 4294967295

And still with below attached full logs it failed

entire.logs.txt

I have the below pip freeze so you are going to either start publishing pre compiled up to date wheels for Windows or fix your script to run on Windows?

Microsoft Windows [Version 10.0.22631.4602]
(c) Microsoft Corporation. All rights reserved.

C:\Users\Furkan>pip freeze
ae-ffmpeg==1.2.0
aiofiles==24.1.0
aiohappyeyeballs==2.4.3
aiohttp==3.11.7
aiosignal==1.3.1
anyio==4.6.2.post1
argon2-cffi==23.1.0
argon2-cffi-bindings==21.2.0
arrow==1.3.0
asttokens==2.4.1
async-lru==2.0.4
async-timeout==5.0.1
attrs==24.2.0
auto-editor==25.3.1
babel==2.16.0
beautifulsoup4==4.12.3
bleach==6.1.0
Brotli==1.1.0
bs4==0.0.2
cachetools==5.5.0
certifi==2024.8.30
cffi==1.17.1
charset-normalizer==3.3.2
colorama==0.4.6
comm==0.2.2
cryptography==44.0.0
debugpy==1.8.7
decorator==4.4.2
defusedxml==0.7.1
discord==2.3.2
discord.py==2.4.0
exceptiongroup==1.2.2
executing==2.1.0
fastjsonschema==2.20.0
ffmpeg==1.4
ffmpeg-python==0.2.0
filelock==3.13.1
fire==0.7.0
fqdn==1.5.1
frozenlist==1.5.0
fsspec==2024.2.0
future==1.0.0
GPUtil==1.4.0
h11==0.14.0
hf_transfer==0.1.8
httpcore==1.0.6
httpx==0.27.2
huggingface-hub==0.26.2
idna==3.10
imageio==2.36.1
imageio-ffmpeg==0.5.1
imutils==0.5.4
inputimeout==1.0.4
ipykernel==6.29.5
ipython==8.28.0
ipywidgets==8.1.5
iso8601==2.1.0
isoduration==20.11.0
jedi==0.19.1
Jinja2==3.1.3
json5==0.9.25
jsonpointer==3.0.0
jsonschema==4.23.0
jsonschema-specifications==2024.10.1
jupyter-events==0.10.0
jupyter-lsp==2.2.5
jupyter_client==8.6.3
jupyter_core==5.7.2
jupyter_server==2.14.2
jupyter_server_terminals==0.5.3
jupyterlab==4.2.5
jupyterlab_pygments==0.3.0
jupyterlab_server==2.27.3
jupyterlab_widgets==3.0.13
llvmlite==0.43.0
markdown-it-py==3.0.0
MarkupSafe==2.1.5
matplotlib-inline==0.1.7
mdurl==0.1.2
mistune==3.0.2
more-itertools==10.5.0
moviepy==1.0.3
mpmath==1.3.0
multidict==6.1.0
mutagen==1.47.0
natsort==8.4.0
nbclient==0.10.0
nbconvert==7.16.4
nbformat==5.10.4
nest-asyncio==1.6.0
networkx==3.2.1
ninja==1.11.1.2
notebook_shim==0.2.4
numba==0.60.0
numpy==1.26.3
nvidia-ml-py==12.535.161
nvitop==1.3.2
openai-whisper @ git+https://github.com/openai/whisper.git@90db0de1896c23cbfaf0c58bc2d30665f709f170
opencv-contrib-python==4.10.0.84
opencv-python==4.10.0.84
overrides==7.7.0
packaging==24.1
pandocfilters==1.5.1
parso==0.8.4
pillow==10.2.0
pillow_heif==0.20.0
platformdirs==4.3.6
proglog==0.1.10
prometheus_client==0.21.0
prompt_toolkit==3.0.48
propcache==0.2.0
psutil==6.0.0
pure_eval==0.2.3
py-cpuinfo==9.0.0
pyav==13.1.0
pycparser==2.22
pycryptodomex==3.21.0
Pygments==2.18.0
PyNaCl==1.5.0
python-dateutil==2.9.0.post0
python-dotenv==1.0.1
python-json-logger==2.0.7
pywin32==308
pywinpty==2.0.14
PyYAML==6.0.2
pyzmq==26.2.0
referencing==0.35.1
regex==2024.9.11
requests==2.32.3
rfc3339-validator==0.1.4
rfc3986-validator==0.1.1
rich==13.9.2
rpds-py==0.20.0
safetensors==0.4.5
Send2Trash==1.8.3
sentencepiece==0.2.0
six==1.16.0
sniffio==1.3.1
soupsieve==2.6
stack-data==0.6.3
sympy==1.13.1
termcolor==2.4.0
terminado==0.18.1
tiktoken==0.7.0
tinycss2==1.3.0
tokenizers==0.20.0
tomli==2.0.2
torch==2.5.0+cu124
torchao==0.7.0
torchaudio==2.5.0+cu124
torchvision==0.20.0+cu124
tornado==6.4.1
tqdm==4.66.5
traitlets==5.14.3
transformers==4.45.1
triton @ https://github.com/woct0rdho/triton-windows/releases/download/v3.1.0-windows.post5/triton-3.1.0-cp310-cp310-win_amd64.whl
twitchio==2.10.0
types-python-dateutil==2.9.0.20241003
typing_extensions==4.9.0
uri-template==1.3.0
urllib3==2.2.3
Wand==0.6.13
wcwidth==0.2.13
webcolors==24.8.0
webencodings==0.5.1
webp==0.4.0
websocket-client==1.8.0
websockets==13.1
widgetsnbextension==4.0.13
windows-curses==2.3.3
WMI==1.5.1
xformers==0.0.28.post2
yarl==1.18.0
yt-dlp==2024.10.7

C:\Users\Furkan>
@FurkanGozukara FurkanGozukara added bug Something isn't working deepspeed-chat Related to DeepSpeed-Chat labels Dec 14, 2024
@loadams loadams self-assigned this Dec 16, 2024
@loadams loadams added windows Questions or PRs relating to running DeepSpeed on Windows build Improvements to the build and testing systems. and removed deepspeed-chat Related to DeepSpeed-Chat bug Something isn't working labels Dec 16, 2024
@loadams
Copy link
Contributor

loadams commented Dec 16, 2024

Hi @FurkanGozukara - we are working on getting Windows support added back in, and wheels published (and automated) again soon.

In the meantime, are you able to identify which deepspeed op is causing the nvcc internal error and confirm if that is the only problem you are hitting/confirm that nvcc is able to properly compile other files?

@FurkanGozukara
Copy link
Author

hi @loadams . i am able to compile some other apps like InsightFace, XPose, UniPose maybe some other 3d related libraries recently i tried like using microsoft/TRELLIS

i really don't know what causes error here entire logs

thank you so much i hope you make automated wheel releases for Windows

entire.logs (1).txt

@FurkanGozukara
Copy link
Author

Hello again @loadams

This guy single handedly and (automatically i presume) publishing all torch and python versions of pre-compiled wheels of pytorch3d

Perhaps it can help you

https://github.com/MiroPsota/torch_packages_builder

I used it and it works amazing

Image

@FurkanGozukara
Copy link
Author

FurkanGozukara commented Jan 14, 2025

Still no Windows Python 3.10 pre-compiled wheels published? @loadams

I have been waiting for months now

Like this one

Image

@loadams
Copy link
Contributor

loadams commented Jan 14, 2025

Hi @FurkanGozukara - I am still working on this, we have a node we can use for CI when we get the master branch building again, which we hope to be soon. I'll tag any PRs on this issue, and at that point we should be able to publish whls for python 3.10/3.11 at that point for future releases. But we also will need to add additional tests to ensure we don't regress on Windows builds in the future.

@FurkanGozukara
Copy link
Author

Hi @FurkanGozukara - I am still working on this, we have a node we can use for CI when we get the master branch building again, which we hope to be soon. I'll tag any PRs on this issue, and at that point we should be able to publish whls for python 3.10/3.11 at that point for future releases. But we also will need to add additional tests to ensure we don't regress on Windows builds in the future.

awesome thank you so much appreciate that

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
build Improvements to the build and testing systems. windows Questions or PRs relating to running DeepSpeed on Windows
Projects
None yet
Development

No branches or pull requests

2 participants