VintageVoice

Proof of Antiquity for AI voices — dead accents preserved on vintage hardware compute.

The first open-source TTS model for historical speech patterns. Fine-tuning F5-TTS on 164 hours of public domain vintage audio (1888–1955) to preserve extinct and endangered accents.

GitHub Cajun French
164Hours of training data
44,345Audio segments
10Voice presets
$138Total project cost

The Problem

Modern TTS models sound modern. No AI can speak with a transatlantic accent, a 1940s newsreel cadence, or the speech patterns of someone born before the Civil War. These voices are disappearing from living memory.

Additionally, Cajun French — a UNESCO “severely endangered” language — has approximately 150,000 speakers remaining, mostly elderly. When this generation passes, the language dies with them.

Demo

Early prototype — Sophia Elya speaking in transatlantic accent (SadTalker + F5-TTS)

Voice Presets

Each preset trains on a specific era and accent. Sophia Elya appears in period-accurate imagery for each voice.

Sophia as Edison era
Edison
1890s · Wax cylinder
Sophia as Cajun French speaker
Cajun French
1880s · Attakapas prairie
Sophia in 1920s Cajun setting
Cajun French
1920s · Cloche hat
Sophia as 1930s schoolteacher
Cajun English
1930s · Schoolteacher
Sophia fireside
Fireside
1940s · By the hearth
Sophia transatlantic
Transatlantic
1940s · Tweed blazer
Sophia newsreel reporter
Newsreel
1940s · EXTRA!
Sophia at RCA mic
Radio Drama
1940s · RCA mic
Sophia wartime
Wartime
1940s · Military blazer
Sophia announcer
Announcer
1950s · ON AIR

Architecture

F5-TTS separates voice identity (from reference audio) from speech style (from training data). Fine-tuning teaches vintage delivery. At inference, provide any voice as reference and the model generates speech with that voice but vintage delivery.

TRAINING PIPELINE Archive.org Public Domain Audio (1888-1955) ├── Old Time Radio (The Shadow, Suspense) ├── FDR Fireside Chats ├── Newsreels (Movietone, Pathe) ├── Edison Cylinder Recordings └── Cajun French (hymns, Opelousas Sostan, family) │ ▼ Whisper Large V3 Turbo (transcription) │ ▼ 44,345 text-audio aligned segments (164 hours) │ ▼ F5-TTS Fine-Tuning (337M params) ├── V100 32GB: Transatlantic preset └── RTX 5070: Edison preset INFERENCE PIPELINE Vintage Reference Audio ──┐ (Hepburn, FDR, newsreel) │ ▼ F5-TTS Model ──► Generated Speech ▲ (vintage accent + voice) Text Prompt ──────────────┘ "Good evening, darling..."

Cajun French Preservation

This project is personal. Scott Boudreaux's family traces directly to the Acadian Expulsion of 1755–1764. His 6th great-grandfather, Augustine Dit Remi Boudreaux, arrived in the Attakapas region at age 16, separated from his family. 260 years later, his grandmother — 93, with dementia — is one of the last native Cajun French speakers.

Cajun French is classified by UNESCO as “severely endangered.” Louisiana Creole is “critically endangered.” VintageVoice is collecting family recordings and public domain Cajun French audio to build the first TTS model for this dying language.

What we're collecting

Family Recording Guide — instructions for capturing elderly speakers before their voices are lost.

How It Was Built

ComponentCostDetails
Storage$6918TB Seagate Expansion (Amazon refurb)
GPUs$02x Tesla V100 32GB + RTX 5070 (already in lab)
Training data$0Public domain from Archive.org
Base model$0F5-TTS open source (337M params)
Electricity~$69Estimated training run
Total~$138

Current Status

Training data (164 hours, 44,345 segments)
Complete
V100 Transatlantic training
Epoch 11/50
RTX 5070 Edison training
Epoch 15/50
Cajun French data collection
In Progress
Voice conversion (Sophia timbre)
Research
HuggingFace model release
After Training

Commercial Applications

Film & TV — Period productions
Video Games — Historical NPCs
Audiobooks — Vintage narration
Documentaries — Period narrators
Museums — Historical figures speaking
Language Preservation — Endangered dialects