Spaces:
Sleeping
Sleeping
ming
commited on
Commit
·
55d10ea
1
Parent(s):
734e281
Add OMP_NUM_THREADS env var and improve Outlines requirement comment
Browse files- Set OMP_NUM_THREADS=1 to fix libgomp warning
- Add comment about Python 3.10+ requirement for Outlines
- Dockerfile +2 -1
- requirements.txt +1 -1
Dockerfile
CHANGED
|
@@ -14,7 +14,8 @@ ENV PYTHONDONTWRITEBYTECODE=1 \
|
|
| 14 |
V4_ENABLE_QUANTIZATION=true \
|
| 15 |
V4_USE_FP16_FOR_SPEED=true \
|
| 16 |
HF_HOME=/tmp/huggingface \
|
| 17 |
-
TRANSFORMERS_NO_TORCHAO=1
|
|
|
|
| 18 |
|
| 19 |
# Set work directory
|
| 20 |
WORKDIR /app
|
|
|
|
| 14 |
V4_ENABLE_QUANTIZATION=true \
|
| 15 |
V4_USE_FP16_FOR_SPEED=true \
|
| 16 |
HF_HOME=/tmp/huggingface \
|
| 17 |
+
TRANSFORMERS_NO_TORCHAO=1 \
|
| 18 |
+
OMP_NUM_THREADS=1
|
| 19 |
|
| 20 |
# Set work directory
|
| 21 |
WORKDIR /app
|
requirements.txt
CHANGED
|
@@ -20,7 +20,7 @@ accelerate>=0.33.0,<1.0.0 # Required for GPU quantization (V4)
|
|
| 20 |
bitsandbytes>=0.44.0 # 4-bit NF4 quantization for GPU (V4)
|
| 21 |
einops>=0.6.0,<1.0.0 # Required for model architecture (V4)
|
| 22 |
scipy>=1.10.0,<2.0.0 # Often needed for unquantized models (V4)
|
| 23 |
-
outlines>=0.0.34 # JSON schema enforcement for V4 structured summarization
|
| 24 |
|
| 25 |
# Testing
|
| 26 |
pytest>=7.0.0,<8.0.0
|
|
|
|
| 20 |
bitsandbytes>=0.44.0 # 4-bit NF4 quantization for GPU (V4)
|
| 21 |
einops>=0.6.0,<1.0.0 # Required for model architecture (V4)
|
| 22 |
scipy>=1.10.0,<2.0.0 # Often needed for unquantized models (V4)
|
| 23 |
+
outlines>=0.0.34 # JSON schema enforcement for V4 structured summarization (requires Python 3.10+)
|
| 24 |
|
| 25 |
# Testing
|
| 26 |
pytest>=7.0.0,<8.0.0
|