GithubHelp home page GithubHelp logo

wujunwei928 / parse-video-py Goto Github PK

View Code? Open in Web Editor NEW
108.0 108.0 30.0 53 KB

Python短视频去水印爬虫:抖音,皮皮虾,火山,微视,最右,快手,全民小视频,皮皮搞笑,西瓜视频,虎牙,梨视频,acfun,好看视频...

License: MIT License

Python 89.21% HTML 9.71% Dockerfile 1.08%
python spider video

parse-video-py's Introduction

Hi there 👋

I am looking for work in Beijing, please contact me if you are interested: [email protected]

我正在寻找北京地区工作机会,有意请联系我:[email protected]

Anurag's GitHub stats

parse-video-py's People

Contributors

wujunwei928 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

parse-video-py's Issues

安装readme.md中步骤部署 大佬你的项目 发现无法访问前端页面

抱歉 大佬好 第一次构建docker项目

第一步 从 GitHub拉取项目到本地 使用vscode 打开
image

第二部 按照readme.md提示
安装依赖 pip install -r requirements.txt
接着安装了docker
运行 docker pull wujunwei928/parse-video-py
image
随后运行命令docker run -d -p 8000:8000 wujunwei928/parse-video
image
此时 打开 前端页面http://127.0.0.1:8000/

image

请问 大佬是哪里的问题呢

同样的步骤 我在宝塔面板里也尝试了一下
然后访问 我的公网IP:8000
发现也不行

image

请问 作者 ,我是不是 哪里做错了

抖音解析应该无法正常工作了

大概是昨天还是前天吧 发现 抖音链接无法解析。

想自己修改下试试,修改了半天,没搞定,只好来请教大佬了

image

我使用的测试链接是 0.23 复制打开抖音,看看【陈翔六点半的作品】老板让我抓重点,我给老板长长脸 # 陈翔六点半 https://v.douyin.com/i6YP3jkV/ Mjp:/ [email protected] 11/04

获取的相关代码部分应该是下面这样的 但具体代码怎么修改 获取这部分一直搞不定 大修大改 又暂时没有信心 麻烦大佬指点

<script nonce="Yap0wSwhyb3dCplGmLQX0" >window._ROUTER_DATA = {"loaderData":{"video_layout":null,"video_(id)\u002Fpage":{"ua":"Mozilla\u002F5.0 (iPad; CPU OS 17_0_3 like Mac OS X) AppleWebKit\u002F605.1.15 (KHTML, like Gecko) Version\u002F17.0 Mobile\u002F15E148 Safari\u002F605.1.15","isSpider":false,"webId":"7385026683889944091","query":{"region":"CN","mid":"7383578121494252297","u_code":"5c50bclh8el","did":"MS4wLjABAAAAi3thxqHIn85sce3UERmYVVQUc_AfvBdrc_yC9jwRWvSRWePlgL9QNwqa1fpcD91L","iid":"MS4wLjABAAAA1UUm99t5akeRe2nnP6LhdVyRTNs-0KjlxtOc6gWDdMbQD7n0iBDhmwTDxYV9OP28","with_sec_did":"1","titleType":"title","share_sign":"Q.jbGGNdrBhit3YHbDL.NVD.HrrYUTkYESuRQe5h0k8-","share_version":"300100","ts":"1719213263","from_aid":"1128","from_ssr":"1","utm_source":"copy","utm_campaign":"client_share","utm_medium":"android","app":"aweme"},"renderInSSR":1,"lastPath":"7383578025868283162","appName":"safari","host":"www.iesdouyin.com","isNotSupportWebp":false,"commonContext":{"ua":"Mozilla\u002F5.0 (iPad; CPU OS 17_0_3 like Mac OS X) AppleWebKit\u002F605.1.15 (KHTML, like Gecko) Version\u002F17.0 Mobile\u002F15E148 Safari\u002F605.1.15","isSpider":false,"webId":"7385026683889944091","query":{"region":"CN","mid":"7383578121494252297","u_code":"5c50bclh8el","did":"MS4wLjABAAAAi3thxqHIn85sce3UERmYVVQUc_AfvBdrc_yC9jwRWvSRWePlgL9QNwqa1fpcD91L","iid":"MS4wLjABAAAA1UUm99t5akeRe2nnP6LhdVyRTNs-0KjlxtOc6gWDdMbQD7n0iBDhmwTDxYV9OP28","with_sec_did":"1","titleType":"title","share_sign":"Q.jbGGNdrBhit3YHbDL.NVD.HrrYUTkYESuRQe5h0k8-","share_version":"300100","ts":"1719213263","from_aid":"1128","from_ssr":"1","utm_source":"copy","utm_campaign":"client_share","utm_medium":"android","app":"aweme"},"renderInSSR":1,"lastPath":"7383578025868283162","appName":"safari","host":"www.iesdouyin.com","isNotSupportWebp":false},"videoInfoRes":{"item_list":[{"aweme_id":"7383578025868283162","desc":"老板让我抓重点,我给老板长长脸 #陈翔六点半","create_time":1719138600,"author":{"short_id":"3559807","nickname":"陈翔六点半","signature":"陈翔导演作品\n小号@六点半日记","avatar_thumb":{"uri":"100x100\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65","url_list":["https:\u002F\u002Fp3.douyinpic.com\u002Faweme\u002F100x100\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65.jpeg?from=327834062","https:\u002F\u002Fp9.douyinpic.com\u002Faweme\u002F100x100\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65.jpeg?from=327834062","https:\u002F\u002Fp6.douyinpic.com\u002Faweme\u002F100x100\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65.jpeg?from=327834062"]},"avatar_medium":{"uri":"100x100\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65","url_list":["https:\u002F\u002Fp3.douyinpic.com\u002Faweme\u002F100x100\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65.jpeg?from=327834062","https:\u002F\u002Fp9.douyinpic.com\u002Faweme\u002F100x100\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65.jpeg?from=327834062","https:\u002F\u002Fp6.douyinpic.com\u002Faweme\u002F100x100\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65.jpeg?from=327834062"]},"follow_status":0,"following_count":0,"favoriting_count":0,"unique_id":"cxldb001","mplatform_followers_count":0,"followers_detail":null,"platform_sync_info":null,"geofencing":null,"policy_version":null,"sec_uid":"MS4wLjABAAAA4N4OrZzTSmCPp8vVAqCeyU215Kav2JgFv2Lfy4DNWRs","type_label":null,"card_entries":null,"mix_info":null},"music":{"mid":"7383578121494252297","title":"@陈翔六点半创作的原声一陈翔六点半(原声中的歌曲:Allo Zoo-Fabio Borgazzi & Maria Grazia Borgazzi)","author":"陈翔六点半","cover_hd":{"uri":"1080x1080\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65","url_list":["https:\u002F\u002Fp3.douyinpic.com\u002Faweme\u002F1080x1080\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65.jpeg?from=327834062","https:\u002F\u002Fp9.douyinpic.com\u002Faweme\u002F1080x1080\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65.jpeg?from=327834062","https:\u002F\u002Fp6.douyinpic.com\u002Faweme\u002F1080x1080\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65.jpeg?from=327834062"]},"cover_large":{"uri":"1080x1080\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65","url_list":["https:\u002F\u002Fp3.douyinpic.com\u002Faweme\u002F1080x1080\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65.jpeg?from=327834062","https:\u002F\u002Fp9.douyinpic.com\u002Faweme\u002F1080x1080\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65.jpeg?from=327834062","https:\u002F\u002Fp6.douyinpic.com\u002Faweme\u002F1080x1080\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65.jpeg?from=327834062"]},"cover_medium":{"uri":"720x720\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65","url_list":["https:\u002F\u002Fp3.douyinpic.com\u002Faweme\u002F720x720\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65.jpeg?from=327834062","https:\u002F\u002Fp9.douyinpic.com\u002Faweme\u002F720x720\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65.jpeg?from=327834062","https:\u002F\u002Fp6.douyinpic.com\u002Faweme\u002F720x720\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65.jpeg?from=327834062"]},"cover_thumb":{"uri":"168x168\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65","url_list":["https:\u002F\u002Fp3.douyinpic.com\u002Fimg\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65~c5_168x168.jpeg?from=327834062","https:\u002F\u002Fp6.douyinpic.com\u002Fimg\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65~c5_168x168.jpeg?from=327834062","https:\u002F\u002Fp9.douyinpic.com\u002Fimg\u002Faweme-avatar\u002Fmosaic-legacy_c150001e6de3d8e4e65~c5_168x168.jpeg?from=327834062"]},"duration":207,"position":null,"status":1},"cha_list":null,"video":{"play_addr":{"uri":"v0200fg10000cprron7og65kgrugr820","url_list":["https:\u002F\u002Faweme.snssdk.com\u002Faweme\u002Fv1\u002Fplaywm\u002F?video_id=v0200fg10000cprron7og65kgrugr820&ratio=720p&line=0"]},"cover":{"uri":"tos-cn-i-dy\u002Fbbd9809b607043ef9d6937afa0b717d8","url_list":["https:\u002F\u002Fp26-sign.douyinpic.com\u002Ftos-cn-i-dy\u002Fbbd9809b607043ef9d6937afa0b717d8~c5_300x400.webp?x-expires=1720666800&x-signature=Oq8FYMIBMxEPXTrWk5Kv2tJJHqg%3D&from=327834062_large&s=PackSourceEnum_DOUYIN_REFLOW&se=false&sc=cover&biz_tag=aweme_video&l=20240627115533ADF8EE314DD98C03BB8F","https:\u002F\u002Fp3-sign.douyinpic.com\u002Ftos-cn-i-dy\u002Fbbd9809b607043ef9d6937afa0b717d8~c5_300x400.webp?x-expires=1720666800&x-signature=DSIR2J6butEYiksXBLEyMLg9Vys%3D&from=327834062_large&s=PackSourceEnum_DOUYIN_REFLOW&se=false&sc=cover&biz_tag=aweme_video&l=20240627115533ADF8EE314DD98C03BB8F","https:\u002F\u002Fp11-sign.douyinpic.com\u002Ftos-cn-i-dy\u002Fbbd9809b607043ef9d6937afa0b717d8~c5_300x400.webp?x-expires=1720666800&x-signature=1EJ6WkAGamwjc1X%2FzE2lo8Zxb64%3D&from=327834062_large&s=PackSourceEnum_DOUYIN_REFLOW&se=false&sc=cover&biz_tag=aweme_video&l=20240627115533ADF8EE314DD98C03BB8F","https:\u002F\u002Fp26-sign.douyinpic.com\u002Ftos-cn-i-dy\u002Fbbd9809b607043ef9d6937afa0b717d8~c5_300x400.jpeg?x-expires=1720666800&x-signature=figW4xZ9Uedsp4fBRviFv1p%2B1BQ%3D&from=327834062_large&s=PackSourceEnum_DOUYIN_REFLOW&se=false&sc=cover&biz_tag=aweme_video&l=20240627115533ADF8EE314DD98C03BB8F"]},"height":1080,"width":1920,"bit_rate":null},"statistics":{"aweme_id":"7383578025868283162","comment_count":8860,"digg_count":290006,"play_count":0,"share_count":21763,"collect_count":13577},"text_extra":[{"start":16,"end":22,"type":1,"hashtag_name":"陈翔六点半","hashtag_id":1585033776113694}],"video_labels":null,"aweme_type":4,"image_infos":null,"risk_infos":{"warn":false,"type":0,"content":"","reflow_unplayable":0},"comment_list":null,"geofencing":null,"video_text":null,"label_top_text":null,"promotions":null,"long_video":null,"images":null,"group_id_str":"7383578025868283162","chapter_list":null,"interaction_stickers":null,"img_bitrate":null}],"filter_list":[],"status_code":0,"extra":{"now":1719460533229,"logid":"20240627115533ADF8EE314DD98C03BB8F"}},"itemId":"7383578025868283162","isVideoOptimize":true,"isAutoOpenApp":true,"darkModeAdaptation":true,"serverToken":""}},"errors":null}</script>

python3.12 运行报错

sses.py", line 1256, in wrap
return _process_class(cls, init, repr, eq, order, unsafe_hash,
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Cellar/[email protected]/3.12.2_1/Frameworks/Python.framework/Versions/3.12/lib/python3.12/dataclasses.py", line 994, in _process_class
cls_fields.append(_get_field(cls, name, type, kw_only))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Cellar/[email protected]/3.12.2_1/Frameworks/Python.framework/Versions/3.12/lib/python3.12/dataclasses.py", line 852, in _get_field
raise ValueError(f'mutable default {type(f.default)} for field '
ValueError: mutable default <class 'parser.base.VideoAuthor'> for field author is not allowed: use default_factory

西瓜视频不支持解析

试了从西瓜视频分享出来的链接解析失败,好像只能解析精选那一栏的视频而已吗?

快手图集 识别有误

快手图集 基本识别正常
但在遇到一种 上下滑动的图集时候 识别张数多了
实际是13张图,具体测试图集链接如下:https://v.kuaishou.com/EYhNCZ 超级好看的全屏壁纸,把亮度调到最高 "超好看的背景图 "请礼貌拿图谢谢 该作品在快手被播放过24.8万次,点击链接,打开【快手】直接观看!

个人测试部署网站 https://daohang.bxmang.top/vp/

Snipaste_2024-05-18_16-48-55

Postman测试结果如下 https://daohang.bxmang.top/vp/video/share/url/parse?url=encodeURIComponent(https://v.kuaishou.com/EYhNCZ )

Snipaste_2024-05-18_16-54-27

  登上GitHub后 非常开心看到大佬更新了 快手图集功能  在网站部署后 因为网址已经部署了其他的项目 所以,修改了请求头 ,多加了 vp后缀  再加上我是小白玩家 代码基本靠Kimi  所以 Nginx  设置如下

image

实际使用时,发现需要简单修改index.html 的请求 当然 大佬也可以也可以说下别的修改方法 我全是全靠蒙,哪疼医哪

image

部署完成就是 https://daohang.bxmang.top/vp/

测试发现的第二个问题是 实际解析链接可以获得 但在具体下载视频时候 总是403 Forbidden, 这具体是什么原因 有解决办法吗 当然按照提示文字 复制链接 打开网页 确实可以解决该问题

实际在看代码时候 发现结构好强 好优雅 大赞

python3.12 运行报错 ValueError: mutable default <class 'parser.base.VideoAuthor'> for field author is not allowed: use default_factory

@dataclasses.dataclass
^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Cellar/[email protected]/3.12.2_1/Frameworks/Python.framework/Versions/3.12/lib/python3.12/dataclasses.py", line 1266, in dataclass
return wrap(cls)
^^^^^^^^^
File "/opt/homebrew/Cellar/[email protected]/3.12.2_1/Frameworks/Python.framework/Versions/3.12/lib/python3.12/dataclasses.py", line 1256, in wrap
return _process_class(cls, init, repr, eq, order, unsafe_hash,
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Cellar/[email protected]/3.12.2_1/Frameworks/Python.framework/Versions/3.12/lib/python3.12/dataclasses.py", line 994, in _process_class
cls_fields.append(_get_field(cls, name, type, kw_only))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Cellar/[email protected]/3.12.2_1/Frameworks/Python.framework/Versions/3.12/lib/python3.12/dataclasses.py", line 852, in _get_field
raise ValueError(f'mutable default {type(f.default)} for field '
ValueError: mutable default <class 'parser.base.VideoAuthor'> for field author is not allowed: use default_factory

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.