Skip to content

[Bug]: YouTube Transcript API get_transcript Fails #75

@Sandhya03242

Description

@Sandhya03242

Version

0.0.3

Operating System

Windows

Python Version

3.8

What happened?

Bug:

  • Using Synthetic-data-kit ingest with the latest youtube-transcript-api fails:
    Error: type object 'YouTubeTranscriptApi' has no attribute 'get_transcript'
  • Downgrading to older version(e.g. ,<1.0.0) restores .get_transcript(), but ingestion fail with:
    ❌ Error: no element found: line 1, column 0
  • However, when fetching transcripts manually using latest youtube-transcript-api works successfully

What did you expect to happen?

I except synthetic-data-kit ingest fetch YouTube transcripts successfully using the latest version of youtube-transcript-api without error

Solution:

  • Update the code to use the latest version of youtube-transcript-api
  • Replace .get_transcript() method with latest .fetch() method

Relevant log output

Steps to reproduce

  1. install the latest youtube-transcript-api
  2. Run synthetic-data-kit ingest with a Youtube video URL
  3. Observer the get_transcript attribute error

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions