8 30 224

Michał Junczyk PRO

michaljunczyk

https://goodmike31.github.io/michaljunczyk/

AI & ML interests

Automatic Speech Recognition, Data Annotation, ML Systems Design, ML Data Management, ML Systems Evaluation

Recent Activity

liked a model about 7 hours ago

pyannote/segmentation-3.0

liked a model about 7 hours ago

pyannote/speaker-diarization-3.1

upvoted a paper 11 days ago

BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline

View all activity

Organizations

liked 2 models about 7 hours ago

pyannote/segmentation-3.0

Voice Activity Detection • Updated May 10, 2024 • 10.7M • 899

pyannote/speaker-diarization-3.1

Automatic Speech Recognition • Updated May 10, 2024 • 10.5M • 1.74k

upvoted 2 papers 11 days ago

BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline

Paper • 2408.15079 • Published Aug 27, 2024 • 56

Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models

Paper • 2603.25750 • Published 25 days ago • 36

liked a Space 14 days ago

Cohere Multilingual ASR

🎙

102

Transcribe audio clips to text in many languages

liked a Space 17 days ago

Voxtral TTS Demo

⚡

191

Generate realistic speech from text with custom or preset voices

upvoted a paper 18 days ago

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published 20 days ago • 62

updated a dataset 28 days ago

michaljunczyk/admedvoice-for-bigos

Viewer • Updated 28 days ago • 26.7k • 87

published a dataset 28 days ago

michaljunczyk/admedvoice-for-bigos

Viewer • Updated 28 days ago • 26.7k • 87

liked a model 28 days ago

utter-project/EuroLLM-22B-Instruct-2512

Text Generation • 23B • Updated Feb 6 • 3.36k • • 63

liked 2 Spaces about 1 month ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

📝

219

Explore synthetic data experiments on a virtual bookshelf

Gradio Chatbot

💬

Chat with an AI assistant using customizable settings

liked 2 models about 1 month ago

microsoft/VibeVoice-ASR

Automatic Speech Recognition • 9B • Updated Jan 27 • 667k • 1.03k

kugelaudio/kugelaudio-0-open

Text-to-Speech • Updated Feb 6 • 5.35k • 183

updated a dataset about 2 months ago

amu-cai/pl-asr-bigos-v2

Updated Feb 18 • 107 • 4

liked 2 datasets about 2 months ago

lion-ai/admedvoice

Viewer • Updated Dec 12, 2025 • 53k • 5 • 2

pipecat-ai/stt-benchmark-data

Viewer • Updated Feb 9 • 1k • 169 • 8