Falcon 40B Inference at 4bit in Google Colab
pinned๐ค
							๐
							
						27
				
								27
#38 opened over 2 years ago
		by
		
				
 serin32
							
						serin32
	
Custom 4-bit Finetuning 5-7 times faster inference than QLora
pinnedโค๏ธ
							๐
							
						7
				
								6
#25 opened over 2 years ago
		by
		
				
 rmihaylov
							
						rmihaylov
	
remove-extra-parentheses
#115 opened over 1 year ago
		by
		
				
 ZennyKenny
							
						ZennyKenny
	
 
							Could not locate the configuration_RW.py inside tiiuae/falcon-40b-instruct.
#114 opened over 1 year ago
		by
		
				
 cosmino
							
						cosmino
	
 
							[AUTOMATED] Model Memory Requirements
#113 opened over 1 year ago
		by
		
				
 model-sizer-bot
							
						model-sizer-bot
	
Adding Evaluation Results
#111 opened over 1 year ago
		by
		
				
 leaderboard-pr-bot
							
						leaderboard-pr-bot
	
 
							Could someone upload a tokenizer.model file? to allow for making ggufs
#110 opened almost 2 years ago
		by
		
				
 RonanMcGovern
							
						RonanMcGovern
	
 
							Add chat_template so that it can be used for chat out-of-box
#109 opened almost 2 years ago
		by
		
				
 chujiezheng
							
						chujiezheng
	
 
							pb when testing the model
#108 opened about 2 years ago
		by
		
				
 louvivien
							
						louvivien
	
Update generation_config.json
								1
#106 opened about 2 years ago
		by
		
				
 nkasmanoff
							
						nkasmanoff
	
 
							Gradio interface
#105 opened about 2 years ago
		by
		
				
 sequentialsystems
							
						sequentialsystems
	
 
							Optimizing Inference Time for Chat Conversations on Falcon
๐
							
						1
				
								2
#104 opened about 2 years ago
		by
		
				
 humza-sami
							
						humza-sami
	
 
							Finetuned Falcon40 is not working with pipeline (text-generation)
#103 opened about 2 years ago
		by
		
				
 chelouche9
							
						chelouche9
	
 
							Advice on inference over a large-ish dataset in Databricks?
#102 opened about 2 years ago
		by
		
				
 archonlith
							
						archonlith
	
 
							Use input attention mask instead of casual mask in attention
#101 opened about 2 years ago
		by
		
				
 CyberZHG
							
						CyberZHG
	
Inference
								4
#99 opened over 2 years ago
		by
		
				
 davidhung
							
						davidhung
	
Best Practice for Handling Variable-Length Sequences in Training an LLM Model on a Chatbot Dataset
#98 opened over 2 years ago
		by
		
				
 humza-sami
							
						humza-sami
	
 
							Request: DOI
#97 opened over 2 years ago
		by
		
				
 waelTalan
							
						waelTalan
	
Getting HTTP Error Code: 422 when using Inference API
								2
#96 opened over 2 years ago
		by
		
				
 reetkat
							
						reetkat
	
Run falcon on Mac
								2
#95 opened over 2 years ago
		by
		
				
 corin9122
							
						corin9122
	
Unable to use all cores.
								2
#94 opened over 2 years ago
		by
		
				
 armx40
							
						armx40
	
 
							Bug: the model's head dimensionality is hardcoded
#93 opened over 2 years ago
		by
		deleted
Fine-tune on model response only?
๐
							
						1
				
								1
#92 opened over 2 years ago
		by
		
				
 mkserge
							
						mkserge
	
Finetuning Base Falcon on Unseen Language/New data (non instruct/RLHF)
								2
#91 opened over 2 years ago
		by
		
				
 AshBam
							
						AshBam
	
Slow response time for 7b and 40b
								6
#89 opened over 2 years ago
		by
		
				
 kartik99
							
						kartik99
	
configuration_RW.py Missing in the latest commit
#88 opened over 2 years ago
		by
		
				
 ravikiran3690
							
						ravikiran3690
	
Update README.md
								2
#87 opened over 2 years ago
		by
		
				
 FelixMildon
							
						FelixMildon
	
Falcon breaks after the second prompt of code.
#86 opened over 2 years ago
		by
		
				
 thecowmilk
							
						thecowmilk
	
Changes in modelling_RW.py to be able to handle past_key_values for faster model generations
								8
#85 opened over 2 years ago
		by
		
				
 puru22
							
						puru22
	
@TII Falcon is stunning but will you continue or is the majestic bird destined to starve ?
#84 opened over 2 years ago
		by
		
				
 cmp-nct
							
						cmp-nct
	
 
							Finetune Error using the notebook referred on the model page
#83 opened over 2 years ago
		by
		
				
 hamad
							
						hamad
	
Nvidia H100 Finetuning Error on BitsandBytes
								2
#82 opened over 2 years ago
		by
		
				
 ashmitbhattarai
							
						ashmitbhattarai
	
new here, confused which .bin file to download?
#80 opened over 2 years ago
		by
		
				
 kingofdelphi
							
						kingofdelphi
	
Update generation_config.json
#77 opened over 2 years ago
		by
		
				
 psinger
							
						psinger
	
 
							Request: DOI
#76 opened over 2 years ago
		by
		
				
 winter6below618
							
						winter6below618
	
Seeking insights on integrating RAG with Falcon for Domain Specific requirements
#75 opened over 2 years ago
		by
		
				
 rahul2008d
							
						rahul2008d
	
 
							Prevent Hallucinations
								1
#74 opened over 2 years ago
		by
		
				
 Zhaoqiong
							
						Zhaoqiong
	
Deployment on Azure ML
								1
#73 opened over 2 years ago
		by
		
				
 Eliahu551818
							
						Eliahu551818
	
 
							Access To Hidden States
#72 opened over 2 years ago
		by
		
				
 DJT777
							
						DJT777
	
Were special tokens trained?
#71 opened over 2 years ago
		by
		
				
 Tron2060
							
						Tron2060
	
Example code from README output is nonsense
								1
#70 opened over 2 years ago
		by
		
				
 amitgurintecom
							
						amitgurintecom
	
New language
								2
#69 opened over 2 years ago
		by
		
				
 mindplay
							
						mindplay
	
GPU requirements
๐
							
						6
				
								7
#68 opened over 2 years ago
		by
		
				
 GuySerk
							
						GuySerk
	
Cuda out of memory error.
								2
#67 opened over 2 years ago
		by
		
				
 ibrim
							
						ibrim
	
ValueError: The following model_kwargs are not used by the model: ['token_type_ids'] (note: typos in the generate arguments will also show up in this list)
								1
#66 opened over 2 years ago
		by
		
				
 yiz4869
							
						yiz4869
	
How to fine tune falcon for summarization on xsum?
								1
#65 opened over 2 years ago
		by
		
				
 uzumakiusa
							
						uzumakiusa
	
Need claritiy about the adjustable model hyperparameters
#64 opened over 2 years ago
		by
		
				
 Someshfengde
							
						Someshfengde
	
Update README.md
#63 opened over 2 years ago
		by
		
				
 Gage888
							
						Gage888
	
Borken docs link Use in transformers
								1
#62 opened over 2 years ago
		by
		
				
 natika1
							
						natika1
	
 
							Hello, may I know where can I get the embeddings for falcon-40b?
๐
							
						4
				
								3
#61 opened over 2 years ago
		by
		
				
 kurtgan
							
						kurtgan
	
