GithubHelp home page GithubHelp logo

doiiarx / nlcisbnplugin Goto Github PK

View Code? Open in Web Editor NEW
420.0 4.0 9.0 41 KB

基于**国家图书馆ISBN检索的calibre的source/metadata插件

Home Page: https://notion.doiiars.com/

License: Apache License 2.0

Python 100.00%
calibre-plugin isbn metadata spider

nlcisbnplugin's Introduction

Hi there,这里是Doiiars 👋

欢迎联系我定制程序和爬虫。

DoiiarX's GitHub stats

Top Langs

  • 擅长爬虫、图像处理(cv)、文档处理
  • 有目标检测项目经验
  • 有GUI设计经验
  • 有网页设计经验
  • 有ChatGPT项目经验
  • 有Docker经验
  • 有树莓派、单片机经验

👤游客访问

DoiiarX

❤ 赞助 Donation

如果你觉得我的项目对你有帮助,请考虑赞助我,以激励我投入更多的时间进行维护与开发。

If you find my project helpful, please consider supporting my project going forward. Your support is greatly appreciated.

image

你的star或者赞助是我长期维护此项目的动力所在,由衷感谢每一位支持者,“每一次你花的钱都是在为你想要的世界投票”。 另外,将本项目推荐给更多的人,也是一种支持的方式,用的人越多更新的动力越足。

nlcisbnplugin's People

Contributors

doiiarx avatar fossabot avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

nlcisbnplugin's Issues

元数据录入优化

希望开发者能优化一下元数据的录入内容:
1.简化书名信息,去掉原数据中的"文献类型"、“英文名称”、“作者”等。
2.优化作者字段,去掉“著”字,这会影响calibre书库目录名称。
3.优化标签字段,将出版社、出版年等信息移入calibre原出版商、出版日期字段。
4.优化标识符字段,nlchash数据似乎没什么用,希望移除。
5.另,优化书名模糊搜索,基本上没有isbn的情况下没有结果。
屏幕截图 2024-02-09 170935

下载原数据显示下载失败

calibre, version 7.3.0
错误: 下载失败: 无法下载元数据。请点击“显示详情”获取更多信息。

Traceback (most recent call last):
  File "calibre\utils\ipc\simple_worker.py", line 303, in main
  File "calibre\ebooks\metadata\sources\worker.py", line 58, in wrapper
  File "calibre\ebooks\metadata\sources\worker.py", line 119, in single_identify
  File "calibre\ebooks\metadata\sources\identify.py", line 499, in identify
  File "calibre\ebooks\metadata\sources\identify.py", line 367, in merge_identify_results
  File "calibre\ebooks\metadata\sources\identify.py", line 155, in finalize
  File "calibre\ebooks\metadata\sources\identify.py", line 249, in merge_isbn_results
  File "calibre\ebooks\metadata\sources\identify.py", line 351, in merge
  File "calibre\ebooks\metadata\book\base.py", line 169, in __getattribute__
AttributeError: Metadata object has no attribute named: 'identify'

字符集异常导致下载失败

Plugin 国家图书馆ISBN插件 failed
Traceback (most recent call last):
File "/usr/lib/calibre/calibre/ebooks/metadata/sources/identify.py", line 47, in run
self.plugin.identify(self.log, self.rq, self.abort, **self.kwargs)
File "calibre_plugins.dummy4.init", line 446, in identify
metadatas = title2metadata(title, log, result_queue, self.clean_downloaded_metadata,
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "calibre_plugins.dummy4.init", line 96, in title2metadata
dynamic_url = get_dynamic_url(log)
^^^^^^^^^^^^^^^^^^^^
File "calibre_plugins.dummy4.init", line 83, in get_dynamic_url
response_text = response.read().decode('utf-8')
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
UnicodeDecodeError: 'utf-8' codec can't decode byte 0x8b in position 1: invalid start byte

Calibre版本7.2安装失败

calibre, version 7.2.0
错误: 未处理的异常: InvalidPlugin:The plugin in 'D:\Download\NLCISBNPlugin-1.0.0.zip' is invalid. It does not contain a top-level init.py file

calibre 7.2 embedded-python: True
Windows-10-10.0.22635-SP0 Windows ('64bit', 'WindowsPE')
('Windows', '10', '10.0.22635')
Python 3.11.5
Windows: ('10', '10.0.22635', 'SP0', 'Multiprocessor Free')
Interface language: zh_CN
Traceback (most recent call last):
File "calibre\gui2\preferences\plugins.py", line 325, in add_plugin
File "calibre\customize\ui.py", line 513, in add_plugin
File "calibre\customize\ui.py", line 64, in load_plugin
File "calibre\customize\zipplugin.py", line 298, in load
File "calibre\customize\zipplugin.py", line 393, in _locate_code
calibre.customize.InvalidPlugin: The plugin in 'D:\Download\NLCISBNPlugin-1.0.0.zip' is invalid. It does not contain a top-level init.py file

因http error 502无法匹配元数据

calibre, version 7.10.0
错误: 无匹配项:

无法为你找到书籍元数据。请尝试扩大搜索范围。比如:只使用作者的姓氏或只包含书名中具有特征性的词。

要查看完整的日志,请点击“显示详情”。

Running identify query with parameters:
{'title': '**古典文学基本丛书002 杜诗详注(第四册)([唐]杜甫撰,[清]仇兆鳌注,中华书局1999)', 'authors': ['[唐]杜甫撰,[清]仇兆鳌注'], 'identifiers': {}, 'timeout': 30}
Using plugins: 国家图书馆ISBN插件 (1, 2, 1)
The log from individual plugins is below

****************************** 国家图书馆ISBN插件 (1, 2, 1) ******************************
Found 0 results
Downloading from 国家图书馆ISBN插件 took 1.6326899528503418
未检测到isbn。
正在根据书名获取metadata...
Plugin 国家图书馆ISBN插件 failed
Traceback (most recent call last):
File "calibre\ebooks\metadata\sources\identify.py", line 47, in run
File "calibre_plugins.dummy1.init", line 446, in identify
File "calibre_plugins.dummy1.init", line 102, in title2metadata
File "urllib\request.py", line 216, in urlopen
File "urllib\request.py", line 525, in open
File "urllib\request.py", line 634, in http_response
File "urllib\request.py", line 563, in error
File "urllib\request.py", line 496, in _call_chain
File "urllib\request.py", line 643, in http_error_default
urllib.error.HTTPError: HTTP Error 502: Bad Gateway


The identify phase took 1.80 seconds
The longest time (1.632690) was taken by: 国家图书馆ISBN插件
Merging results from different sources
We have 0 merged results, merging took: 0.00 seconds

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.