You are browsing as a guest. Sign up (or log in) to start making projects!

Solena

  • 2 Devlogs
  • 4 Total hours

An AI model to research the efficiency of tokenizer methods

Open comments for this post

2h 0m 8s logged

Day#2

Completed the generation scripts and scaled up the training wohooo!
Temporarily using Kaggle for their TPUs so shoutout to them and their support team who solved my issue veeery fast. Loss curves doing pretty good although it has plateaued a bit but it haappennss we fix and continue!

Day#2

Completed the generation scripts and scaled up the training wohooo!
Temporarily using Kaggle for their TPUs so shoutout to them and their support team who solved my issue veeery fast. Loss curves doing pretty good although it has plateaued a bit but it haappennss we fix and continue!

Replying to @lixiri

0
1
Open comments for this post

1h 49m 1s logged

Day#1

Built the decode/encode pipeline and prepared the training dataset. Training scripts are ready. After creating generation and inference pipeline we will continue with training. Shoutout to TPU Research Cloud for providing access to compute and support!

Day#1

Built the decode/encode pipeline and prepared the training dataset. Training scripts are ready. After creating generation and inference pipeline we will continue with training. Shoutout to TPU Research Cloud for providing access to compute and support!

Replying to @lixiri

0
2

Followers

Loading…