Qwen3-TTS for Swiss German: Dialect Adaptation via Talker Layer Fine-Tuning
How we adapted Qwen3-TTS to Swiss German by training only the last 8 talker layers and creating a dedicated speaker per dialect — with audio examples from ou...
Field notes, thoughts, experiments, and shipping lessons from production ASR and NLP work and other cool stuff at work.
How we adapted Qwen3-TTS to Swiss German by training only the last 8 talker layers and creating a dedicated speaker per dialect — with audio examples from ou...
From Telegram pings to automated calorie tracking and agentic coding-how OpenClaw changed my relationship with AI.
Exploring the counter-intuitive relationship between training loss and Word Error Rate (WER) during Whisper fine-tuning.
How decoder cross-attention weights can be used to create high-precision token-to-time alignments without external models.
Solving the Overcooked Game with PPO.
Competing at the top of the Numerai tournament with live-deployed ML models
Presentation about Fine-tuning Whisper for Swiss-German at SwissText 2025
Science Publication: We segment and extract transient oxygen ‘dips’ from bioluminescence recordings using semi-NMF.