Week 3 MVP Development #

Project name: VoiceDiary

Code repository: link

VoiceDiary is an AI-powered voice journaling tool that analyzes tone, emotion, and key themes in spoken entries. It generates personalized emotional insights and well-being suggestions based on recorded reflections.

Team #

Team member	Telegram alias	Innopolis Email	Responcibilities
Dziyana Melnikava	@meldilen24	dz.melnikava@innopolis.university	PM, Frontend
Anastasia Kuchumova	@n_rngk	a.kuchumova@innopolis.university	Frontend, UX/UI
Dzhamilia Fatkullina	@jam11a	d.fatkullina@innopolis.university	ML
Elina Kuzmichyova	@lin_anile	e.kuzmichyova@innopolis.university	ML
Olesia Novoselova	@doiwannaknoww8	o.novoselova@innopolis.university	Backend
Danil Davydyan	@chocop	d.davydyan@innopolis.university	Backend

Implemented MVP features #

Emotion recognition based on voice
Summary for user voices
User frendly design: minmum effort functionality
Pause and resume recording without data loss
Visual feedback while recording: timer and waveform

Demonstration of the working MVP #

CLICK TO SEE

Frontend #

Responsible: Dziyana Melnikava, Anastasia Kuchumova

Link to Frontend contribution: CLICK TO SEE

Backend #

Responsible: Olesia Novoselova, Danil Davydyan

Link to Backend contribution: CLICK TO SEE

Swagger demonstartion: CLICK TO SEE

ML #

Responsible: Dzhamilia Fatkullina, Elina Kuzmichyova

Link to ML contribution: CLICK TO SEE

Link to the training code: CLICK TO SEE

Link to model artifacts: CLICK TO SEE

Individual commitments #

Team member	Telegram alias	Contribution	Link
Dziyana Melnikava (Lead)	@meldilen24	Connected frontend with backend, Finished adaptive design of Sign-Up and Loging page, Made custom Error handling, updated README	Commits, README
Anastasia Kuchumova	@n_rngk	Made results page, Improved UX-UI by adding above microphone clues and a loader, Updated Dockerfile	Commits
Dzhamilia Fatkullina	@jam11a	Add new emotion recognition model, added multilingual support and translation, write report	Commits, README
Elina Kuzmichyova	@lin_anile	Train Wav2Vec model, add model for text summary, write report	Commits, Models, README
Olesia Novoselova	@doiwannaknoww8	Implemented the logic for voice data transfer, improved integration between the Go API and the Python ML service, set up Docker Compose	Commits, Swagger Demo
Danil Davydyan	@chocop	Improved communication between the backend and ML, established communication between the Go API and Python API	Commits

Plan for Next Week #

ML #

Research emotion recognition model based on text
Research LLM model to add support comments based on user voices

Frontend #

Finish Authorization page and connect it with backend
Create Hompage
Limit record length for unauthorized users

Backend #

Service logs
Secure storage of user data
Performance measurements under load
Authorization

Confirmation of the code’s operability #

We confirm that the code in the main branch:

[✓] In working condition.
[✓] Run via docker-compose (or another alternative described in the README.md).