Splitting up long chunks of automatically transcribed text

Поделиться
HTML-код
  • Опубликовано: 28 сен 2024

Комментарии • 12

  • @rygy409
    @rygy409 4 месяца назад +11

    A QoL feature might be highlighting the words on the right panel if they are included in the current clips on the timeline.

  • @jomojomo123
    @jomojomo123 4 месяца назад

    "is it a bug or is it just a little bit of a surprise" 😂😂

  • @CrossbowBeta
    @CrossbowBeta 4 месяца назад +1

    Next step: Editing the text and the video gets updated.

  • @distortions
    @distortions 4 месяца назад +4

    whats up whats up whats up

  • @bart2019
    @bart2019 4 месяца назад

    In your place, i'd try to use a text editing widget for the script text and use it in read-only mode. That may detecting which word a user clicked on easier because it's likely built in.
    Also, you need a way to search in the text, again, just like in a text editor.

  • @gastonmaffei
    @gastonmaffei 4 месяца назад +1

    one thing I'd add is an LLM powered search bar to search for nuggets in natural language. So something like "show me where I talk about X", and it jumps to that timestamp

  • @Dje4321
    @Dje4321 4 месяца назад

    Should highlight the current position in the text window to help with debugging to you know exactly what words are at which specific parts in the timelines

  • @CrossbowBeta
    @CrossbowBeta 4 месяца назад

    So much fun statistics can be collected with this too. How many words spoken during a stream. Word occurrence histogram. Average word complexity. Then group the words together to identify a set of tags for the video. This is so cool.

  • @flobuilds
    @flobuilds 4 месяца назад

    I miss the time where we played trackmania on the school computers. It was fun

  • @gvteja4908
    @gvteja4908 4 месяца назад

    Along with the text highlighting which others have mentioned, maybe change the audio based on text edits in the transcript, although this might be a very intense thing to do

  • @RoryDavidWatts
    @RoryDavidWatts 4 месяца назад

    This is looking great

  • @deprecated_channel_
    @deprecated_channel_ 4 месяца назад +1

    Hi,
    I really like your project and seeing you go through the dev process.
    I think it would be easier to track if the current spoken word would be in bold, or a different color. A bit like karaoke.
    And from what I understand, Whisper already gives you the ranges of each word.
    While playing the video, iterating through the list would be fast.
    And on seek, since the word & time range list is in a sorted order, doing a binary search should be pretty fast.