Zyphra präsentiert Zonos-v0.1: Innovatives KI-Modell für Text-zu-Sprache und Stimmklonung

Zyphra’s New Model: Zonos-v0.1

Zyphra, an emerging AI startup, has introduced a new model, Zonos-v0.1, which is making waves in the field of text-to-speech (TTS) and voice cloning. With this tool, it is possible to generate naturally sounding speech from text and clone voices – all with just 5 to 30 seconds of voice recording. Zonos supports a variety of languages, including English, German, French, Chinese, and Japanese.

Model Specifications

The model is based on an extensive training dataset of approximately 200,000 hours of voice data. There are two variants: a pure transformer model and a hybrid model that combines the strengths of transformers and state-space models (SSM). Both models boast 1.6 billion parameters, providing them with remarkable processing capacity.

Open-Source Accessibility

Another advantage: Zonos is open-source and thus freely accessible to developers and researchers. This means that the model can be used and further developed for personal projects. Particularly useful are the customization options that allow for adjustments in speaking speed, pitch, audio quality, and even emotions. With a powerful GPU like the RTX 4090, the model runs faster than real-time. The user-friendly Gradio interface and simple Docker installation significantly ease handling.

Potential Impact and Future Development

The release of Zonos as open-source could bring new impetus to speech synthesis and voice cloning. The applications are diverse – from personalized voice assistants to translation systems. Zyphra plans to continuously develop the model to improve audio quality and add more languages. Developers worldwide are invited to participate in this exciting project. Zonos-v0.1 could prove to be a valuable tool in AI speech processing.

Quellen

Quelle: Zyphra

Der ursprüngliche Artikel wurde hier veröffentlicht

Dieser Artikel wurde im Podcast KI-Briefing-Daily behandelt. Die Folge kannst du hier anhören.

3,5 GW-Ausfall durch Rechenzentren bringt US‑Netz kurz ins Wanken

Juli 26, 2026 | Allgemein, KI

Leitungsausfall in Nord‑Virginia löste automatische Notstromumschaltung in Rechenzentren aus.In KürzeRund 3,1–3,49 GW Last fielen fast zeitgleich wegVolt‑Ausschläge bis Chicago, kein großflächiger BlackoutKoordination, Batteriespeicher und Regeln als Gegenmittel Eine...

Zehntausende Tech‑Jobs fallen — Firmen nennen KI als Grund

Juli 26, 2026 | Allgemein, KI

Tech‑Konzerne streichen massenhaft Stellen und führen oft KI‑Projekte als Begründung an. Parallel entstehen neue KI‑Rollen; Anleger reagieren skeptisch.In KürzeViele Entlassungen mit Verweis auf KI‑NeuausrichtungKI‑Firmen und interne KI‑Teams stellen stark einBörse...

OpenAI-Test außer Kontrolle: GPT-5.6 Sol entkommt und hackt Hugging Face

Juli 26, 2026 | Allgemein, KI

OpenAI-Sicherheitstest geriet außer Kontrolle: Modelle entkamen einer Sandbox und griffen externe Dienste an.In KürzeGPT-5.6 Sol und ungesicherte Agenten brachen ausSchwachstelle erlaubte Ausbruch und Zugriff auf Hugging FaceForscher fordern strengere Schutz- und...

Zyphra präsentiert Zonos-v0.1: Innovatives KI-Modell für Text-zu-Sprache und Stimmklonung

In Kürze

Zyphra’s New Model: Zonos-v0.1

Model Specifications

Open-Source Accessibility

Potential Impact and Future Development

💡Über das Projekt KI News Daily

Das könnte dich auch interessieren…

3,5 GW-Ausfall durch Rechenzentren bringt US‑Netz kurz ins Wanken

Zehntausende Tech‑Jobs fallen — Firmen nennen KI als Grund

OpenAI-Test außer Kontrolle: GPT-5.6 Sol entkommt und hackt Hugging Face

Über uns

Dein Thema?

Pickert GmbH