doiiarx / nlcisbnplugin Goto Github PK

View Code? Open in Web Editor NEW

420.0 4.0 9.0 41 KB

基于**国家图书馆ISBN检索的calibre的source/metadata插件

Home Page: https://notion.doiiars.com/

License: Apache License 2.0

Python 100.00%

calibre-plugin isbn metadata spider

nlcisbnplugin's Introduction

Hi there，这里是Doiiars 👋

欢迎联系我定制程序和爬虫。

擅长爬虫、图像处理（cv）、文档处理
有目标检测项目经验
有GUI设计经验
有网页设计经验
有ChatGPT项目经验
有Docker经验
有树莓派、单片机经验

👤游客访问

❤ 赞助 Donation

如果你觉得我的项目对你有帮助，请考虑赞助我，以激励我投入更多的时间进行维护与开发。

If you find my project helpful, please consider supporting my project going forward. Your support is greatly appreciated.

你的star或者赞助是我长期维护此项目的动力所在，由衷感谢每一位支持者，“每一次你花的钱都是在为你想要的世界投票”。另外，将本项目推荐给更多的人，也是一种支持的方式，用的人越多更新的动力越足。

nlcisbnplugin's People

Contributors

Stargazers

Watchers

Forkers

brainku nonomal rogerxu266 poorkids fossabot zzzhao2023 juanlynn bigmouse0001 shzhbook

nlcisbnplugin's Issues

请问将来能否支持**国家版本馆的数据呢？

https://pdc.capub.cn/
**国家版本馆的数据，我自己根据手头图书查找的结果，感觉比**国家图书馆要好的多。
所以冒昧问一下，将来能否支持**国家版本馆的数据呢？
谢谢!

元数据录入优化

希望开发者能优化一下元数据的录入内容：
1.简化书名信息，去掉原数据中的"文献类型"、“英文名称”、“作者”等。
2.优化作者字段，去掉“著”字，这会影响calibre书库目录名称。
3.优化标签字段，将出版社、出版年等信息移入calibre原出版商、出版日期字段。
4.优化标识符字段，nlchash数据似乎没什么用，希望移除。
5.另，优化书名模糊搜索，基本上没有isbn的情况下没有结果。

下载原数据显示下载失败

calibre, version 7.3.0
错误： 下载失败: 无法下载元数据。请点击“显示详情”获取更多信息。

Traceback (most recent call last):
  File "calibre\utils\ipc\simple_worker.py", line 303, in main
  File "calibre\ebooks\metadata\sources\worker.py", line 58, in wrapper
  File "calibre\ebooks\metadata\sources\worker.py", line 119, in single_identify
  File "calibre\ebooks\metadata\sources\identify.py", line 499, in identify
  File "calibre\ebooks\metadata\sources\identify.py", line 367, in merge_identify_results
  File "calibre\ebooks\metadata\sources\identify.py", line 155, in finalize
  File "calibre\ebooks\metadata\sources\identify.py", line 249, in merge_isbn_results
  File "calibre\ebooks\metadata\sources\identify.py", line 351, in merge
  File "calibre\ebooks\metadata\book\base.py", line 169, in __getattribute__
AttributeError: Metadata object has no attribute named: 'identify'

Plugin 国家图书馆ISBN插件 failed
Traceback (most recent call last):
File "/usr/lib/calibre/calibre/ebooks/metadata/sources/identify.py", line 47, in run
self.plugin.identify(self.log, self.rq, self.abort, **self.kwargs)
File "calibre_plugins.dummy4.init", line 446, in identify
metadatas = title2metadata(title, log, result_queue, self.clean_downloaded_metadata,
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "calibre_plugins.dummy4.init", line 96, in title2metadata
dynamic_url = get_dynamic_url(log)
^^^^^^^^^^^^^^^^^^^^
File "calibre_plugins.dummy4.init", line 83, in get_dynamic_url
response_text = response.read().decode('utf-8')
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
UnicodeDecodeError: 'utf-8' codec can't decode byte 0x8b in position 1: invalid start byte

当单一isbn对应多本书籍时，无法下载元数据

有点疑惑，这个不需要登录国图账号获取cookie吗？？

Calibre版本7.2安装失败

calibre, version 7.2.0
错误：未处理的异常: InvalidPlugin:The plugin in 'D:\Download\NLCISBNPlugin-1.0.0.zip' is invalid. It does not contain a top-level init.py file

calibre 7.2 embedded-python: True
Windows-10-10.0.22635-SP0 Windows ('64bit', 'WindowsPE')
('Windows', '10', '10.0.22635')
Python 3.11.5
Windows: ('10', '10.0.22635', 'SP0', 'Multiprocessor Free')
Interface language: zh_CN
Traceback (most recent call last):
File "calibre\gui2\preferences\plugins.py", line 325, in add_plugin
File "calibre\customize\ui.py", line 513, in add_plugin
File "calibre\customize\ui.py", line 64, in load_plugin
File "calibre\customize\zipplugin.py", line 298, in load
File "calibre\customize\zipplugin.py", line 393, in _locate_code
calibre.customize.InvalidPlugin: The plugin in 'D:\Download\NLCISBNPlugin-1.0.0.zip' is invalid. It does not contain a top-level init.py file

因http error 502无法匹配元数据

calibre, version 7.10.0
错误：无匹配项:

无法为你找到书籍元数据。请尝试扩大搜索范围。比如：只使用作者的姓氏或只包含书名中具有特征性的词。

要查看完整的日志，请点击“显示详情”。

Running identify query with parameters:
{'title': '**古典文学基本丛书002 杜诗详注（第四册）（[唐]杜甫撰，[清]仇兆鳌注，中华书局1999）', 'authors': ['[唐]杜甫撰，[清]仇兆鳌注'], 'identifiers': {}, 'timeout': 30}
Using plugins: 国家图书馆ISBN插件 (1, 2, 1)
The log from individual plugins is below

****************************** 国家图书馆ISBN插件 (1, 2, 1) ******************************
Found 0 results
Downloading from 国家图书馆ISBN插件 took 1.6326899528503418
未检测到isbn。
正在根据书名获取metadata...
Plugin 国家图书馆ISBN插件 failed
Traceback (most recent call last):
File "calibre\ebooks\metadata\sources\identify.py", line 47, in run
File "calibre_plugins.dummy1.init", line 446, in identify
File "calibre_plugins.dummy1.init", line 102, in title2metadata
File "urllib\request.py", line 216, in urlopen
File "urllib\request.py", line 525, in open
File "urllib\request.py", line 634, in http_response
File "urllib\request.py", line 563, in error
File "urllib\request.py", line 496, in _call_chain
File "urllib\request.py", line 643, in http_error_default
urllib.error.HTTPError: HTTP Error 502: Bad Gateway

The identify phase took 1.80 seconds
The longest time (1.632690) was taken by: 国家图书馆ISBN插件
Merging results from different sources
We have 0 merged results, merging took: 0.00 seconds