Neurlang Project

neurlang

28 1 75

https://blog.neurlang.online

neurlang

AI & ML interests

hashtrons/weightless (non neural) networks

Recent Activity

updated a model 2 days ago

neurlang/en-whipstr-base-48khz-libritts-r

reacted to ginigen-ai's post with 🤯 3 days ago

🍳 The RoboCasa Kitchen Leaderboard What does it take for a robot to handle kitchen chores the way a person does? It has to see (Vision), understand instructions (Language), and actually act (Action) — and VLA (Vision-Language-Action) models are emerging as the answer. They're the bridge between large multimodal models and real-world embodied control. RoboCasa Kitchen is a leading robot-learning benchmark in which a single-arm robot (Franka Panda) performs 24 atomic manipulation tasks — picking up cups and bowls, opening drawers and doors, turning faucets, pressing buttons, and more — inside a photorealistic simulated kitchen. Because the layout and object placement are randomized every episode, it tests genuine generalization rather than memorized motions. The score (success rate, SR) is the average fraction of the 24 tasks completed as instructed, measured over multiple seeds so results aren't down to luck. The catch: this benchmark has no official leaderboard, and protocols (number of demonstrations, evaluation setup) differ from paper to paper, leaving scores scattered. Lining the numbers up naively quickly turns into an apples-to-oranges comparison. This leaderboard fixes that by collecting published scores with their sources and comparing only what is genuinely comparable. It's split into three tables: 🏆 Kitchen 24-task (matched) — head-to-head under identical conditions (per the RLDX-1 Technical Report). This is the core ranking you can actually trust. ➕ Other protocols — self-reported under different setups (e.g. fewer demos). Not directly comparable, so kept separate. 🤖 GR1-Tabletop — a different, humanoid-based variant suite, separated to avoid confusion. Any researcher can submit their own model's score directly, and submissions are reviewed before they appear on the board. Every number links to its source paper, so you can verify it yourself. 👉 https://huggingface.co/spaces/ginigen-ai/robocasa-kitchen-leaderboard

reacted to pankajpandey-dev's post with 🚀 7 days ago

🇮🇳 New in my Hindi LLM Series: Gemma-4 E4B, fine-tuned for Hindi — and it runs on your laptop's CPU. I fine-tuned Google's new Gemma-4 E4B on ~10k Hindi instruction pairs (AI4Bharat: anudesh + dolly) using Unsloth + LoRA, on a single L4 GPU. Then I ran an honest side-by-side eval: base Gemma-4 vs my fine-tune, across 25 Hindi prompts. The results were interesting 👇 ✅ My fine-tune is more concise — ask for "3 tips" and it gives exactly 3. Base writes a 1,200-character essay. ✅ Pure native Hindi — base keeps slipping into English ("संतुलित आहार (Eat a Balanced Diet)", "तारा (Star)"). My fine-tune stays in clean Hindi. ✅ Tighter instruction-following — ask for a "short message" and it gives one, not a menu of options. ⚖️ And to be honest: base Gemma-4 is more detailed and comprehensive. I didn't build a "smarter" model — I built a focused, Hindi-native, edge-friendly one that runs as a 5GB GGUF (Q4) on CPU. 🔗 Try it: Live demo (CPU): https://huggingface.co/spaces/pankajpandey-dev/gemma-4-e4b-hindi-demo GGUF (Ollama/llama.cpp): https://huggingface.co/pankajpandey-dev/gemma-4-e4b-hindi-instruct-GGUF 16-bit model: https://huggingface.co/pankajpandey-dev/gemma-4-e4b-hindi-instruct Built with @unsloth · Data by @ai4bharat 🙏 #Hindi #LLM #Gemma #Unsloth #IndicNLP #GGUF

View all activity

Organizations

None yet

updated a model 2 days ago

neurlang/en-whipstr-base-48khz-libritts-r

Automatic Speech Recognition • 17.2M • Updated 2 days ago • 4 • 3

reacted to ginigen-ai's post with 🤯 3 days ago

Post

5175

🍳 The RoboCasa Kitchen Leaderboard
What does it take for a robot to handle kitchen chores the way a person does? It has to see (Vision), understand instructions (Language), and actually act (Action) — and VLA (Vision-Language-Action) models are emerging as the answer. They're the bridge between large multimodal models and real-world embodied control.

RoboCasa Kitchen is a leading robot-learning benchmark in which a single-arm robot (Franka Panda) performs 24 atomic manipulation tasks — picking up cups and bowls, opening drawers and doors, turning faucets, pressing buttons, and more — inside a photorealistic simulated kitchen. Because the layout and object placement are randomized every episode, it tests genuine generalization rather than memorized motions. The score (success rate, SR) is the average fraction of the 24 tasks completed as instructed, measured over multiple seeds so results aren't down to luck.

The catch: this benchmark has no official leaderboard, and protocols (number of demonstrations, evaluation setup) differ from paper to paper, leaving scores scattered. Lining the numbers up naively quickly turns into an apples-to-oranges comparison.

This leaderboard fixes that by collecting published scores with their sources and comparing only what is genuinely comparable. It's split into three tables:

🏆 Kitchen 24-task (matched) — head-to-head under identical conditions (per the RLDX-1 Technical Report). This is the core ranking you can actually trust.
➕ Other protocols — self-reported under different setups (e.g. fewer demos). Not directly comparable, so kept separate.
🤖 GR1-Tabletop — a different, humanoid-based variant suite, separated to avoid confusion.

Any researcher can submit their own model's score directly, and submissions are reviewed before they appear on the board. Every number links to its source paper, so you can verify it yourself.

👉 ginigen-ai/robocasa-kitchen-leaderboard

reacted to pankajpandey-dev's post with 🚀 7 days ago

Post

7808

🇮🇳 New in my Hindi LLM Series: Gemma-4 E4B, fine-tuned for Hindi — and it runs on your laptop's CPU.
I fine-tuned Google's new Gemma-4 E4B on ~10k Hindi instruction pairs (AI4Bharat: anudesh + dolly) using Unsloth + LoRA, on a single L4 GPU.
Then I ran an honest side-by-side eval: base Gemma-4 vs my fine-tune, across 25 Hindi prompts. The results were interesting 👇
✅ My fine-tune is more concise — ask for "3 tips" and it gives exactly 3. Base writes a 1,200-character essay.

✅ Pure native Hindi — base keeps slipping into English ("संतुलित आहार (Eat a Balanced Diet)", "तारा (Star)"). My fine-tune stays in clean Hindi.

✅ Tighter instruction-following — ask for a "short message" and it gives one, not a menu of options.
⚖️ And to be honest: base Gemma-4 is more detailed and comprehensive. I didn't build a "smarter" model — I built a focused, Hindi-native, edge-friendly one that runs as a 5GB GGUF (Q4) on CPU.
🔗 Try it:

Live demo (CPU): pankajpandey-dev/gemma-4-e4b-hindi-demo
GGUF (Ollama/llama.cpp): pankajpandey-dev/gemma-4-e4b-hindi-instruct-GGUF
16-bit model: pankajpandey-dev/gemma-4-e4b-hindi-instruct

Built with @unsloth · Data by @ai4bharat 🙏
#Hindi #LLM #Gemma #Unsloth #IndicNLP #GGUF

12 replies

updated a model 10 days ago

neurlang/piper-onnx-slovakspeech-female-slovak-0.8.0

Updated 10 days ago

published a model 10 days ago

neurlang/piper-onnx-slovakspeech-female-slovak-0.8.0

Updated 10 days ago

liked a model 15 days ago

owensong/Inflect-Nano-v1

Text-to-Speech • Updated 11 days ago • 214

New activity in neurlang/low-quality-multilingual-sentences about 1 month ago

[bot] Conversion to Parquet

#1 opened 3 months ago by

parquet-converter

reacted to kavyamanohar's post with 🔥 about 1 month ago

Post

4598

Releasing Vividh-ASR — an open benchmark and models for Hindi and Malayalam ASR.

Vividh-ASR is built from public data, stratified by complexity:
→ Clean recordings
→ Noisy and accented speech
→ Spontaneous, conversational audio

Alongside the benchmark, we release:
→ Open models for Hindi and Malayalam
→ A training recipe with two counterintuitive choices that moved the needle
→ What failed, not just what worked

The stratified evaluation methodology transfers directly to any low-resource language setup — beyond Hindi and Malayalam.

Built at @adalatai , where we build speech tech for Indian courts. This is our first open contribution back to the community. @janaab @Kush0610 @orgh0

Link: https://huggingface.co/blog/adalat-ai/vividh-benchmark

liked a dataset about 2 months ago

neurlang/slovakspeech_male_dataset

Updated May 17 • 13 • 2

updated a dataset about 2 months ago

neurlang/slovakspeech_male_dataset

Updated May 17 • 13 • 2

reacted to cesear64's post with 🔥 about 2 months ago

Post

4132

Just published: how we built production Sango (Central African Republic) translation without fine-tuning, parallel corpus, or training compute.

The method — vocabulary-augmented prompting with a 581-entry native-speaker-verified lexicon — generalizes to any of the ~2,000 African languages at the same data-poverty level. Recipe, dataset, and code template all included.

📄 Blog: https://huggingface.co/blog/MEYNG/sangoai
📦 Dataset: MEYNG/sango-vocabulary

Would especially value feedback from anyone working on other low-resource African languages — Ewondo, Lingala, Wolof next on our roadmap.

2 replies

liked a model about 2 months ago

Godelaune/Kokoro-82M-ONNX-German-Martin

Text-to-Speech • Updated May 22 • 17

reacted to unmodeled-tyler's post with 😎 about 2 months ago

Post

4126

Hey Hugging Face!

Repo: https://github.com/unmodeled-tyler/vessel-browser

I wanted to share a cool feature from my open source AI native web browser, Vessel: Persistent highlights!

You can highlight anything on the page and the context is provided to the agent. It's kind of a fun way to learn about new stuff, synthesize info, or just deepen your comprehension/understanding.

Since highlights are persistent, you can close the page, come back later - and your highlights will be exactly where you left them. I've found this particularly useful when reviewing technical blogs, model cards, etc.

Check it out!