Spaces:

colin730
/

SummarizerApp

Running

ming commited on 15 days ago

Commit

87d9e3a

1 Parent(s): 6e48ad3

Fix TextStreamer batch size error in V2 API

- Add batch size validation to ensure TextIteratorStreamer receives batch size 1
- Handle 1D tensors by adding batch dimension
- Handle oversized batches by taking first sample only
- Maintains compatibility with all model types (T5, BART, etc.)
- Fixes 'TextStreamer only supports batch size 1' error

Files changed (1) hide show

app/services/hf_streaming_summarizer.py +11 -0

app/services/hf_streaming_summarizer.py CHANGED Viewed

@@ -213,6 +213,17 @@ class HFStreamingSummarizer:
             inputs = inputs.to(self.model.device)
             # Create streamer for token-by-token output
             streamer = TextIteratorStreamer(
                 self.tokenizer,

             inputs = inputs.to(self.model.device)
+            # CRITICAL FIX: Ensure batch size is 1 for TextIteratorStreamer
+            # The streamer only works with batch size 1, so we need to ensure
+            # that all input tensors have batch dimension of 1
+            for key, tensor in inputs.items():
+                if tensor.dim() > 1 and tensor.size(0) > 1:
+                    # If batch size > 1, take only the first sample
+                    inputs[key] = tensor[:1]
+                elif tensor.dim() == 1:
+                    # If tensor is 1D, add batch dimension
+                    inputs[key] = tensor.unsqueeze(0)
             # Create streamer for token-by-token output
             streamer = TextIteratorStreamer(
                 self.tokenizer,