Interview request: genAI evaluation & documentation
#61 opened about 1 year ago
		by
		
				
							
						meggymuggy
	
language dependency
#60 opened over 1 year ago
		by
		
				
							
						Jay369
	
[AUTOMATED] Model Memory Requirements
#59 opened over 1 year ago
		by
		
				
							
						model-sizer-bot
	
Deployments to Azure and Inference Endpoints
#55 opened over 1 year ago
		by
		
				
							
						mo2024
	
Very sensitve to any repetition penalty!
π
							
						1
				#52 opened over 1 year ago
		by
		
				
							
						jukofyork
	
Text2SQL2Output
#51 opened over 1 year ago
		by
		
				
							
						Sudipta179002
	
The generated response cannot stop.
									1
	#50 opened over 1 year ago
		by
		
				
							
						shaohuay
	
Saving dbrx model and tokenizer in dbfs
									5
	#49 opened over 1 year ago
		by
		
				
							
						pro-shep
	
OSError: Unable to load vocabulary from file
									7
	#47 opened over 1 year ago
		by
		
				
							
						khurramnaseem
	
TypeError: __init__() got an unexpected keyword argument 'bias'
									2
	#46 opened over 1 year ago
		by
		
				
							
						dainesn1
	
[DO NOT REVIEW] Mixtral like config
#45 opened over 1 year ago
		by
		
				
							
						Pernekhan
	
Why clamp qkv_states, is it common?
#44 opened over 1 year ago
		by
		
				
							
						jay68
	
Chat template
									9
	#43 opened over 1 year ago
		by
		
				
							
						ehartford
	
GGUF quants?
									1
	#41 opened over 1 year ago
		by
		
				
							
						Iommed
	
Does the tokenizer of this model have a network to load successfully?
									3
	#40 opened over 1 year ago
		by
		
				
							
						Rnake
	
VRAM Requirements?
									8
	#39 opened over 1 year ago
		by
		
				
							
						dounykim
	
How to get hands on experience as a newbie
									1
	#38 opened over 1 year ago
		by
		
				
							
						kimsia
	
Text2sql template and examples
									3
	#34 opened over 1 year ago
		by
		
				
							
						daxiongshu
	
Continuation of the Discussion: More than 10 minutes the status is in Setting `pad_token_id` to `eos_token_id`:100257 for open-end generation. #28
β
							
						2
				
									7
	#31 opened over 1 year ago
		by
		
				
							
						Madhugraj
	
Errors During Training for the Original Implementation and the Fixes for the Errors
π
							
						2
				
									2
	#24 opened over 1 year ago
		by
		
				
							
						v2ray
	
Instruct dataset
π
							
						2
				#23 opened over 1 year ago
		by
		
				
							
						Andriy
	
How to Fine Tune DBRX-Instruct?
									7
	#18 opened over 1 year ago
		by
		
				
							
						elysiia
	
Bug on AMD MI 250 with flash-attention
								3
#13 opened over 1 year ago
		by
		
				
							
						PierreColombo
	
The fused expert parameters means load_in_4bit doesn't work properly, nor does LoRA
π§ 
							π
							
						7
				
								31
#10 opened over 1 year ago
		by
		
				
							
						tdrussell