Spaces:

colin730
/

SummarizerApp

Running

ming commited on 24 days ago

Commit

6b859f2

1 Parent(s): 441f66b

Fix Python 3.10 requirement and torch_dtype deprecation

- Update Dockerfile to Python 3.10 (Outlines requires 3.10+)
- Replace deprecated torch_dtype with dtype parameter
- Fixes TypeError with Outlines library on Python 3.9

Files changed (2) hide show

Dockerfile +1 -1
app/services/structured_summarizer.py +2 -2

Dockerfile CHANGED Viewed

@@ -1,5 +1,5 @@
 # Hugging Face Spaces compatible Dockerfile - V4 GPU INT4
-FROM python:3.9-slim
 # Set environment variables for V4 GPU deployment
 ENV PYTHONDONTWRITEBYTECODE=1 \

 # Hugging Face Spaces compatible Dockerfile - V4 GPU INT4
+FROM python:3.10-slim
 # Set environment variables for V4 GPU deployment
 ENV PYTHONDONTWRITEBYTECODE=1 \

app/services/structured_summarizer.py CHANGED Viewed

@@ -139,7 +139,7 @@ class StructuredSummarizer:
                 logger.info("Loading V4 model in FP16 for maximum speed (2-3x faster than 4-bit)...")
                 self.model = AutoModelForCausalLM.from_pretrained(
                     settings.v4_model_id,
-                    torch_dtype=torch.float16,
                     device_map="auto",
                     cache_dir=settings.hf_cache_dir,
                     trust_remote_code=True,
@@ -160,7 +160,7 @@ class StructuredSummarizer:
                 self.model = AutoModelForCausalLM.from_pretrained(
                     settings.v4_model_id,
-                    torch_dtype=base_dtype,
                     device_map="auto" if use_cuda else None,
                     cache_dir=settings.hf_cache_dir,
                     trust_remote_code=True,

                 logger.info("Loading V4 model in FP16 for maximum speed (2-3x faster than 4-bit)...")
                 self.model = AutoModelForCausalLM.from_pretrained(
                     settings.v4_model_id,
+                    dtype=torch.float16,
                     device_map="auto",
                     cache_dir=settings.hf_cache_dir,
                     trust_remote_code=True,
                 self.model = AutoModelForCausalLM.from_pretrained(
                     settings.v4_model_id,
+                    dtype=base_dtype,
                     device_map="auto" if use_cuda else None,
                     cache_dir=settings.hf_cache_dir,
                     trust_remote_code=True,