Week #3

Week #3 #

Deployment #

Website with functionality: https://search.innohassle.ru
API deployment: https://api.innohassle.ru/search/staging-v0/docs

Implemented MVP features #

We have implemented the basic functionality of user search. At the moment, text search in mongo works fully, ml-search works but requires testing. Closed issues see on the board, project search.

Demonstration of the working MVP #

Image

Image

Image

Backend #

Anna wrote an endpoint for running parsers and saving their results in mongodb (see commit) and implemented basic text search in mongodb (will be used if the ml-model is unable to process the request, see commit)

Vladimir wrote an API for ml and wrote search logic (see commit)

Azaliia wrote a scheduler for periodically calling the parsing endpoint (see commit).

Anna also refactored the code and bugfixed so that the backend and ml worked in a common pipeline (see here or here or in other commits if needed).

ML #

Sofia connected the vector database (lancedb) and set up the ml pipeline (this commit).

Azaliia wrote a function that updates the contents of lancedb (called when parsers are launched, see).

Frontend #

Aliia completed the layout of components, connected the frontend with the backend (search, filtering; see commit)

ML artefacts #

At this stage of our work, our ml-part is implemented as a pipeline with a vector database, see the code here

In the future, we want to connect LLM for other functionality of our project (we assume that at the initial stage it will be one of the openai models and in the future a local model, for example ollama)

Internal demo #

The conducted testing revealed areas for improvement:

  • Add page anchors to links to lead the user to the desired section
  • Remove duplicate resources in ml search (the same link is issued several times because it appears in several chunks)
  • Improving the frontend (rethinking filtering options)

Weekly commitments #

Individual contribution of each participant #

Team MemberContribution
Anna Belyakova (Lead)See backend section
Vladimir PaskalSee backend section
Azaliia AlishevaSee backend and ML sections
Aliia BashirovaSee frontend section
Sofia PushkarevaSee ML section

Plan for Next Week #

Main: #

  • Test the search
  • Make the code readable, add logging
  • Synchronize the team’s understanding of the project code

Other, priority TBD: #

  • Handle moodle resources
  • Add resources for the current search pipeline (calling other APIs, parsing other resources)
  • Write API for the ask section functionality

Confirmation of the code’s operability #

!!! The working code in the backend repository is in the main branch, and in the frontend repository in the capstone branch (difficulties due to automatic deployment)

  • In working condition.
  • Run via docker-compose (or another alternative described in the README.md).