Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
1343378764516004864.0
TFLOPS
16
18
35
Constantin
Alexandre-Numind
Follow
vishnuo2's profile picture
thomwolf's profile picture
devopen's profile picture
11 followers
·
8 following
https://www.numind.ai/
AI & ML interests
Training AI models @Numind
Recent Activity
new
activity
1 day ago
Qwen/Qwen3.5-35B-A3B:
I would like a recommended training environment setup for the Qwen3.5-MoE model (e.g., Qwen3.5-35B-A3B, model_type: qwen3_5_moe).
upvoted
a
paper
24 days ago
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text
upvoted
a
paper
24 days ago
Kimi K2.5: Visual Agentic Intelligence
View all activity
Organizations
Alexandre-Numind
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
Qwen/Qwen3.5-35B-A3B
1 day ago
I would like a recommended training environment setup for the Qwen3.5-MoE model (e.g., Qwen3.5-35B-A3B, model_type: qwen3_5_moe).
👍
1
1
#16 opened 2 days ago by
444515liuxin
New activity in
numind/NuMarkdown-8B-Thinking
7 months ago
fixing numind.svg width
#1 opened 7 months ago by
NathanFradet
New activity in
numind/NuExtract-tiny
over 1 year ago
Add base_model metadata
#2 opened over 1 year ago by
davanstrien
New activity in
numind/NuExtract
over 1 year ago
How to provide examples in predict_NuExtract
1
#2 opened over 1 year ago by
vijayg
New activity in
numind/NuExtract-large
over 1 year ago
NuExtract-large 7b and NuExtract 3.8B have same size model file
2
#4 opened over 1 year ago by
mohit0928
Error Sample Code
1
#2 opened over 1 year ago by
joon09
New activity in
microsoft/Phi-3-small-8k-instruct
almost 2 years ago
RuntimeError: FlashAttention only support fp16 and bf16 data type during fine tuning.
➕
👍
7
8
#11 opened almost 2 years ago by
faizsameerahmed96
New activity in
numind/NuNER
almost 2 years ago
Change to non commercial license
#3 opened almost 2 years ago by
saattrupdan
New activity in
numind/NuSentiment
almost 2 years ago
Change to non commercial license
#2 opened almost 2 years ago by
saattrupdan
New activity in
numind/NuSentiment
about 2 years ago
[bot] Conversion to Parquet
#1 opened about 2 years ago by
parquet-converter
New activity in
numind/NuNER
about 2 years ago
[bot] Conversion to Parquet
#1 opened about 2 years ago by
parquet-converter
Librarian Bot: Add language metadata for dataset
#2 opened about 2 years ago by
librarian-bot
New activity in
numind/NuSentiment-multilingual
about 2 years ago
Adding `safetensors` variant of this model
#1 opened over 2 years ago by
SFconvertbot
New activity in
numind/NuSentiment-multilingual
over 2 years ago
Adding `safetensors` variant of this model
#2 opened over 2 years ago by
SFconvertbot
New activity in
numind/NuSentiment
over 2 years ago
Adding `safetensors` variant of this model
#1 opened over 2 years ago by
SFconvertbot