nithinraok
		
	commited on
		
		
					Commit 
							
							Β·
						
						a0b7f5b
	
1
								Parent(s):
							
							652da3a
								
add v3
Browse filesSigned-off-by: nithinraok <[email protected]>
    	
        README.md
    CHANGED
    
    | @@ -265,6 +265,9 @@ img { | |
| 265 | 
             
            }
         | 
| 266 | 
             
            </style>
         | 
| 267 |  | 
|  | |
|  | |
|  | |
| 268 | 
             
            ## Description:
         | 
| 269 | 
             
            NVIDIA NeMo Canary Flash [1] is a family of multilingual multi-tasking models based on Canary architecture [2] that achieve state-of-the-art performance on multiple speech benchmarks. With 883 million parameters and an inference speed of more than 1000 RTFx (on open-asr-leaderboard datasets), canary-1b-flash supports automatic speech-to-text recognition (ASR) in four languages (English, German, French, Spanish) and translation from English to German/French/Spanish and from German/French/Spanish to English with or without punctuation and capitalization (PnC). Additionally, canary-1b-flash offers an experimental feature for word-level and segment-level timestamps in English, German, French, and Spanish.
         | 
| 270 | 
             
            This model is released under the permissive CC-BY-4.0 license and is available for commercial use.
         | 
|  | |
| 265 | 
             
            }
         | 
| 266 | 
             
            </style>
         | 
| 267 |  | 
| 268 | 
            +
            > **π NEW: Canary 1B V2 is now available!**  
         | 
| 269 | 
            +
            > π **25 European Languages** | β±οΈ **Much Improved Timestamp Prediction** | π **Enhanced ASR & AST** | π **[Try it here: nvidia/canary-1b-v2](https://huggingface.co/nvidia/canary-1b-v2)**
         | 
| 270 | 
            +
             | 
| 271 | 
             
            ## Description:
         | 
| 272 | 
             
            NVIDIA NeMo Canary Flash [1] is a family of multilingual multi-tasking models based on Canary architecture [2] that achieve state-of-the-art performance on multiple speech benchmarks. With 883 million parameters and an inference speed of more than 1000 RTFx (on open-asr-leaderboard datasets), canary-1b-flash supports automatic speech-to-text recognition (ASR) in four languages (English, German, French, Spanish) and translation from English to German/French/Spanish and from German/French/Spanish to English with or without punctuation and capitalization (PnC). Additionally, canary-1b-flash offers an experimental feature for word-level and segment-level timestamps in English, German, French, and Spanish.
         | 
| 273 | 
             
            This model is released under the permissive CC-BY-4.0 license and is available for commercial use.
         | 
