Week3

Week 3 MVP Development #

Project name: VoiceDiary

Code repository: link

VoiceDiary is an AI-powered voice journaling tool that analyzes tone, emotion, and key themes in spoken entries. It generates personalized emotional insights and well-being suggestions based on recorded reflections.

Team #


Team memberTelegram aliasInnopolis EmailResponcibilities
Dziyana Melnikava@meldilen24dz.melnikava@innopolis.universityPM, Frontend
Anastasia Kuchumova@n_rngka.kuchumova@innopolis.universityFrontend, UX/UI
Dzhamilia Fatkullina@jam11ad.fatkullina@innopolis.universityML
Elina Kuzmichyova@lin_anilee.kuzmichyova@innopolis.universityML
Olesia Novoselova@doiwannaknoww8o.novoselova@innopolis.universityBackend
Danil Davydyan@chocopd.davydyan@innopolis.universityBackend

Implemented MVP features #

  • Emotion recognition based on voice
  • Summary for user voices
  • User frendly design: minmum effort functionality
  • Pause and resume recording without data loss
  • Visual feedback while recording: timer and waveform

Demonstration of the working MVP #

CLICK TO SEE

Frontend #

Responsible: Dziyana Melnikava, Anastasia Kuchumova

Link to Frontend contribution: CLICK TO SEE


Backend #

Responsible: Olesia Novoselova, Danil Davydyan

Link to Backend contribution: CLICK TO SEE

Swagger demonstartion: CLICK TO SEE


ML #

Responsible: Dzhamilia Fatkullina, Elina Kuzmichyova

Link to ML contribution: CLICK TO SEE

Link to the training code: CLICK TO SEE

Link to model artifacts: CLICK TO SEE

Individual commitments #


Team memberTelegram aliasContributionLink
Dziyana Melnikava (Lead)@meldilen24Connected frontend with backend, Finished adaptive design of Sign-Up and Loging page, Made custom Error handling, updated READMECommits, README
Anastasia Kuchumova@n_rngkMade results page, Improved UX-UI by adding above microphone clues and a loader, Updated DockerfileCommits
Dzhamilia Fatkullina@jam11aAdd new emotion recognition model, added multilingual support and translation, write reportCommits, README
Elina Kuzmichyova@lin_anileTrain Wav2Vec model, add model for text summary, write reportCommits, Models, README
Olesia Novoselova@doiwannaknoww8Implemented the logic for voice data transfer, improved integration between the Go API and the Python ML service, set up Docker ComposeCommits, Swagger Demo
Danil Davydyan@chocopImproved communication between the backend and ML, established communication between the Go API and Python APICommits

Plan for Next Week #

ML #

  • Research emotion recognition model based on text
  • Research LLM model to add support comments based on user voices

Frontend #

  • Finish Authorization page and connect it with backend
  • Create Hompage
  • Limit record length for unauthorized users

Backend #

  • Service logs
  • Secure storage of user data
  • Performance measurements under load
  • Authorization

Confirmation of the code’s operability #

We confirm that the code in the main branch:

  • [✓] In working condition.
  • [✓] Run via docker-compose (or another alternative described in the README.md).