AI创想
标题:
LangChain支持哔哩哔哩视频总结
[打印本页]
作者:
dzmzyqy
时间:
昨天 22:39
标题:
LangChain支持哔哩哔哩视频总结
作者:毛毛的毛毛
是基于LangChain框架下的开发,所以最开始请先
pip install Langchain
pip install bilibili-api-python
复制代码
技术要点:
使用Langchain框架自带的Document loaders
修改BiliBiliLoader的源码,自带的并不支持当前b站的视频加载
源码文件修改:
import json
import re
import warnings
from typing import List, Tuple
import requests
from langchain_core.documents import Document
from bilibili_api import sync, video
from langchain_community.document_loaders.base import BaseLoader
# Pre-compile regular expressions for video ID extraction
BV_PATTERN = re.compile(r"BV\w+")
AV_PATTERN = re.compile(r"av[0-9]+")
class BiliBiliLoader(BaseLoader):
"""
Loader for fetching transcripts from BiliBili videos.
"""
def __init__(self, video_urls: List[str], sessdata: str, bili_jct: str, buvid3: str):
"""Initialize with bilibili url.
Args:
video_urls (List[str]): List of BiliBili v
复制代码
原文地址:https://blog.csdn.net/weixin_41227420/article/details/136238039
欢迎光临 AI创想 (https://www.llms-ai.com/)
Powered by Discuz! X3.4