MonkeySpeak
- 6 Devlogs
- 6 Total hours
track how fast you can speak .. as that is going to be the new way of communicating with tech
track how fast you can speak .. as that is going to be the new way of communicating with tech
095b731 and earlier speech routing work on main)the ui now shows errors for the provider that actually failed, not a generic “mic blocked” when stt died for other reasons.
var name ur wish websocket proxy over the vercel http bridgeutterance_end_ms locked to 1000 everywhere (deepgram rejects live ws with 400 below that)BridgeReady, not on first random chunkbackend/ express + ws proxy deployable on render (render.yaml included)
set up env with ur var name at vercel
redeployed frontend via vercel cli after env update
brave and edge block or mishandle the vercel http audio bridge (duplex fetch upload), so deepgram connections hung ~25s and never returned transcripts.
those browsers also cannot open an authenticated websocket straight to api.deepgram.com, so they need the render ws proxy (wss://…/api/deepgram/proxy) instead of the chrome-friendly paths.
even after the proxy worked, deepgram sent json in binary ws frames and the client ignored anything that was not a string, so words never updated until we parsed blob/arraybuffer payloads.
sorry if i yapped a lot and the fixes where actually slighly more techy stuff so i didnt wanna yap abt that as wel so yeah if u want to know lmk in replies :)
reworked the ui and got monkeyspeak ready for v0 ship launch :))
it’s basically monkeytype but for your voice — read a prompt out loud, get scored on speed + clarity.
features:
built with next.js, typescript, gsap, zustand.
repo: github.com/nothariharan/monkeyspeak
made the ui more clean still settling the latency here and there
current new approach i am building around is using the deepgram api key for initial purpose later trying to replicate what they do
open to questions or recommendations for the whole stt etc.
cooking up 🚀🔥
soon making it public just lots of latency issue in speaking to text conversion who knew it would be a pain but yeah let me know if u guys think of an alternative for it or anything at all
thanks for reading :)
going with a much better ui soon launching :) 🚀
for yall to try it
built the first iteration of the website !
got lots of customizing options now i am working more towards reducing the latency between what u speak and what is being transcribed
give me your opinions and recommendations :)