System Design for Project Sophia

Sunday, September 22nd 2024

One of the issues I've come up against while building the Sophia app has been getting stuck in specific parts.

In the past, it was easy because it was small and I would keep on moving.

One of the biggest problems was the dependence on 11labs, and the poor quality.

I will need to address this in a timely fashion when I get a chance to audit every audio file.

This might be something I do a little bit of every day, that's just the only way to make progress.

But today I want to lay out the overall system in a cleaner way and make sure that I can break things down so that trouble shooting one point doesn't regress the entire system.

This will ideally be in the form of a visual diagram, but first I'm going to jot down the different components:

  1. Surface YouTube Lists
  2. Enable the Selection of a Video
  3. Add the Video to the Compaction pipeline

The Compaction Pipeline

  1. Create a record, a Compaction
  2. Extract the Audio as an Audio File
  3. Create the Audio File as a Text Transcript
  4. Take the Transcript through an LLM to get a summary
  5. Take the summary through Voice Generator to get Audio
  6. Make the Audio available via the Firebase database
  7. Push the Audio out onto a Podcast
  8. Push the Audio out onto YouTube

What tool will I use for the visual?

Whimsical

The Visual Workspace | Whimsical