I added a feature where the user will upload a file that will be saved in /upload and then call an OpenAI’s whisper transcription API to do the task of speech-to-text. Since the app only shows text in mere html, I’ll have to improve its look later on. It’s also important that I have to figure out the coaching logic to evaluate the user’s speech (pace, tone, filler words, audience, or what purpose?). Next step is to focus on AI’s recommendation for users’ speech improvement.
Comments 0
No comments yet. Be the first!
Sign in to join the conversation.