Evaluating bids

Italian Native Speaker Voice Data Collection for Ai Training

Published on the January 03, 2026 in Design & Multimedia

About this project

Open

We are urgently seeking native Italian speakers (age 18-55, clear & standard pronunciation, no strong regional accent) to record high-quality voice data for wake-word and offline command training. Each participant will record a total of 98 sentences. Wake word "Hey Nawa" must be read naturally connected (no pause between "Hey" and "Nawa"). Wake word part: read 2 times per person (1x normal speed, 1x slightly faster). Offline command part: each command sentence is read 2 times (1x normal speed, 1x slightly faster), in the format: "Hey Nawa" + >=2 seconds silence + command sentence. Total ~98 utterances per person (wake-word + command combinations). Recording requirements: Device: smartphone (built-in microphone) Distance: ~fist distance from mouth Environment: very quiet room, no audible background noise Minimum SNR: >=15 dB Format: 16 kHz, 16-bit, mono WAV Each file must have >=1 second silence at beginning and end No clipping, no breathing noise, no mispronunciation, clear articulation File naming & folder structure (very important): Folder name example: IT_Milan_Female_001 or IT_Rome_Male_007 File name examples: Hey Nawa normal.wav Hey Nawa faster.wav Hey Nawa Inizia la pulizia normal.wav Hey Nawa Inizia la pulizia faster.wav ... (Exact text + speed label) Please provide separate folders or clearly named groups for wake-word training data and offline command training data. Include matching text transcriptions. We will provide the full sentence list + reference audio for "Nawa" pronunciation after selection.

Category Design & Multimedia
Subcategory Other
Project size Small

Delivery term: Not specified

Skills needed