Commit Graph

15 Commits

Author SHA1 Message Date
b69ba150c5 Remove debug 2024-03-29 12:28:26 +08:00
3a352a9705 Dont need to chunk this anymore 2024-03-29 12:05:32 +08:00
08ed061756 Change model for youtube transcripts 2024-03-29 11:31:27 +08:00
1afec1ddf3 check if feature is granted when process youtube video 2024-03-28 14:41:30 +08:00
38f959685f Change warn to warning 2024-03-27 15:41:15 +08:00
28222dcd56 Add more logging to youtube jobs 2024-03-27 11:25:46 +08:00
014ccb1eae Set published date from youtube data
its not clear how consistent the formatting is on these dates,
the one i tested seems like it was a UTC formatted dd M, yyyy string
2024-03-20 13:10:41 +08:00
596ab5a7aa Feature flag the transcripts 2024-03-15 20:28:02 +08:00
f2d23626a5 Improve error logging / fix linting 2024-03-14 17:39:13 +08:00
dc6c047aec Add GCS cache 2024-03-14 16:12:06 +08:00
629c044273 Queue the transcript processing as a separate job
Handle YouTube in two steps, first get metadata then
get the transcript.
2024-03-14 15:09:53 +08:00
fef28d1c6a Linting fix 2024-03-14 12:48:56 +08:00
3ee6787e39 Improve transcript generation 2024-03-14 12:48:56 +08:00
39dfa920b5 More fixes to youtube processor 2024-03-14 12:48:56 +08:00
7c3d15e31a Add some scrolling on youtube videos 2024-03-14 12:48:56 +08:00