AI & ML interests

None defined yet.

Recent Activity

FlameF0XΒ  updated a Space 2 days ago
Auto-PreTrain/APT-product
FlameF0XΒ  updated a model 3 days ago
Auto-PreTrain/APT-GPT
FlameF0XΒ  published a model 3 days ago
Auto-PreTrain/APT-GPT
View all activity

FlameF0XΒ 
updated a Space 2 days ago
FlameF0XΒ 
posted an update 5 months ago
view post
Post
4089
I am very sad to say that the budget in creating of SnowflakeCore-G1 1b and 7b MoE models ran out and I can't pre-train them anymore.
Β·
FlameF0XΒ 
posted an update 6 months ago
view post
Post
535
the training for SnowflakeCore-G1-1B and 7B would be retaken because now I implemented DeepSpeed and management to use two gpus.
FlameF0XΒ 
posted an update 6 months ago
view post
Post
273
The development of SnowflakeCore-G1-7B-MoE it getting delay. In the mean time I am working on SnowflakeCore-G1-1B-MoE witch would be a pre-train chatbot.
  • 1 reply
Β·
FlameF0XΒ 
posted an update 6 months ago
view post
Post
2956
The development of SnowflakeCore-G1-7B-MoE. I can't say when it would be publish yet because it's big and it requires a lot of computational power.
  • 1 reply
Β·
FlameF0XΒ 
posted an update 6 months ago
FlameF0XΒ 
posted an update 6 months ago
view post
Post
313
Hello! Important announcement, I will rename SnowflakeCore-G1-Medium to SnowflakeCore-G1-Tiny2 because it's going to have the same parameters as the Tiny version, but this one is trained on more data.
  • 1 reply
Β·
FlameF0XΒ 
posted an update 6 months ago
view post
Post
745
Currently working on SnowflakeCore-G1-Medium. [Updated loss cruve]
  • 3 replies
Β·
FlameF0XΒ 
posted an update 6 months ago
FlameF0XΒ 
posted an update 6 months ago
FlameF0XΒ 
posted an update 7 months ago
FlameF0XΒ 
posted an update 7 months ago
view post
Post
256
SnowflakeCore-G1 Update:
Got it running and training! Context window is currently set to 2048 tokens.
Training is active and stable. Will share results once I have some metrics to report.
  • 2 replies
Β·
FlameF0XΒ 
posted an update 7 months ago
view post
Post
1937
SnowflakeCore-G1 development update: We're building a 24-layer transformer with 32K context and 1024 embedding dimensions - pretty ambitious! Even running at batch_size=1 with heavy gradient accumulation, we're hitting memory walls at 300GB RAM. Scaling up to ~1TB will take some time, but the architecture is looking promising. Thanks for following along with the journey! πŸ˜…
  • 1 reply
Β·
FlameF0XΒ 
posted an update 7 months ago
view post
Post
1152
Hello there!
I just find out that all the SnowflakeCore-G0 series are Mask Language Models instead of LLM's.
The development of SnowflakeCore-G0-Releas-3 would be delayed even more.

Edit: I officially end the development of SnowflakeCore-G0 and start the development of SnowflakeCore-G1 what SHOULD be the text generator.

Edit-2: After some evaluation of the code, the models are actual Text Generator. So the development of G0 will continue.
FlameF0XΒ 
posted an update 7 months ago
view post
Post
1375
Hi everyone!
The release of https://huggingface.co/FlameF0X/SnowflakeCore-G0-Release-3-1B is currently delayed due to hardware limitationsβ€”I'm currently lacking the compute resources needed to complete training. I'm exploring options and will keep you updated on any progress.
Thank you for your patience and support!
FlameF0XΒ 
posted an update 7 months ago
FlameF0XΒ 
posted an update 7 months ago