GithubHelp home page GithubHelp logo

magicdawn / weread-spy Goto Github PK

View Code? Open in Web Editor NEW
268.0 5.0 54.0 15.99 MB

weread-spy: 使用微信读书 Web 版生成 ePub 电子书

License: MIT License

JavaScript 1.07% TypeScript 83.21% HTML 6.22% CSS 9.40% Shell 0.10%

weread-spy's People

Contributors

magicdawn avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

weread-spy's Issues

下载错误

昨天正常使用,今天就一直出错,提示TimeoutError: Waiting failed: 30000ms exceeded,请看下问题在哪
当前浏览链接像是一本书:
[url]: https://weread.qq.com/web/reader/ca832f80813ab8065g015794?
[title]: 博尔赫斯教授:二十五堂文学课(博尔赫斯全集)-豪尔赫·路易斯·博尔赫斯 马蒂恩·阿里亚斯 马蒂恩·哈迪斯-微信读书
? 是否下载: Yes
file:///D:/pnpm/globalDir/5/.pnpm/[email protected]/node_modules/puppeteer-core/lib/esm/puppeteer/common/WaitTask.js:56
void this.terminate(new TimeoutError(Waiting failed: ${options.timeout}ms exceeded));
^

TimeoutError: Waiting failed: 30000ms exceeded
at Timeout. (file:///D:/pnpm/globalDir/5/.pnpm/[email protected]/node_modules/puppeteer-core/lib/esm/puppeteer/common/WaitTask.js:56:37)
at listOnTimeout (node:internal/timers:573:17)
at process.processTimers (node:internal/timers:514:7)

书籍下载不完整

有些书籍章节很长, 每个章节只能下载第一页的内容,需要点下一页才能看到

卡在处理内容环节

当前浏览链接像是一本书:
   [url]: https://weread.qq.com/web/reader/c8732110813ab7f1ag015eaa?
 [title]: 非亲非故-索耳-微信读书
? 是否下载:  Yes
  weread-spy:download 已收集章节 id=2 +0ms
  weread-spy:download 已收集章节 id=3 +3s
  weread-spy:download 已收集章节 id=4 +4s
  weread-spy:download 已收集章节 id=5 +4s
  weread-spy:download 已收集章节 id=6 +5s
  weread-spy:download 已收集章节 id=7 +2s
  weread-spy:download 已收集章节 id=8 +4s
  weread-spy:download 已收集章节 id=9 +2s
  weread-spy:download 已收集章节 id=10 +2s
  weread-spy:download 已收集章节 id=11 +1s
  weread-spy:download book id = 3300062985 url = https://weread.qq.com/web/reader/c8732110813ab7f1ag015eaakc81322c012c81e728d9d180 +106ms
  weread-spy:download downloaded to /Users/xx/Library/Application Support/weread-spy/books/3300062985-非亲非故.json +0ms
  weread-spy:one -------------------- download complete -------------------- +0ms
  weread-spy:utils:epub epubgen 3300062985 -> /Users/xx/prj/wxread/非亲非故.epub +0ms
  weread-spy:utils:epub-img cleaning: remove imgs.json /Users/xx/Library/Application Support/weread-spy/books/3300062985-非亲非故/imgs.json +0ms
  weread-spy:utils:epub-img cleaning: remove imgs dir /Users/xx/Library/Application Support/weread-spy/books/3300062985-非亲非故/imgs +3ms
  weread-spy:utils:epub-img imgSrcs collected, length = 19, unique length = 19 +2s
  weread-spy:utils:epub-img add cover url = https://weread-1258476243.file.myqcloud.com/weread/cover/47/cpplatform_rqpr84e2rd5ytyqqaj4dzd/s_cpplatform_rqpr84e2rd5ytyqqaj4dzd1688101138.jpg +1ms
  weread-spy:utils:epub-img download img complete +2s
2023-08-09T06:35:31.278Z weread-spy:utils:processContent processContent for title=非亲非故 chapterUid=7
2023-08-09T06:35:31.279Z weread-spy:utils:processContent processContent for title=非亲非故 chapterUid=2
2023-08-09T06:35:31.280Z weread-spy:utils:processContent processContent for title=非亲非故 chapterUid=5
2023-08-09T06:35:31.283Z weread-spy:utils:processContent processContent for title=非亲非故 chapterUid=8
2023-08-09T06:35:31.284Z weread-spy:utils:processContent processContent for title=非亲非故 chapterUid=4
2023-08-09T06:35:31.285Z weread-spy:utils:processContent processContent for title=非亲非故 chapterUid=6
2023-08-09T06:35:32.663Z weread-spy:utils:processContent processContent for title=非亲非故 chapterUid=9
2023-08-09T06:35:31.379Z weread-spy:utils:processContent processContent for title=非亲非故 chapterUid=3
2023-08-09T06:35:32.865Z weread-spy:utils:processContent processContent for title=非亲非故 chapterUid=10
2023-08-09T06:35:33.148Z weread-spy:utils:processContent processContent for title=非亲非故 chapterUid=11

然后一直停住不继续往下处理了。这本书有个特点是一章不分段,一章就用一个自然段写完,不知道和这个有没有关系

章节数过多时,会显示堆内存不足

当我下载《罗辑思维荐书集锦(套装共24册)》尤瓦尔·赫拉利等 | 微信读书
https://weread.qq.com/book-detail?type=1&senderVid=320541134&v=f903217072525e4bf9018be&wtheme=white&wfrom=app&wvid=320541134&scene=bottomSheetShare
章节id到714时,会报错,显示堆内存不足,不知道是不是所有内容较多的书都报错
报错信息如下:
<--- Last few GCs --->

[19003:0x55740cc6be90] 1266761 ms: Scavenge (reduce) 2046.6 (2068.9) -> 2046.4 (2069.4) MB, 188.2 / 0.0 ms (average mu = 0.973, current mu = 0.938) allocation failure;
[19003:0x55740cc6be90] 1267409 ms: Mark-sweep (reduce) 2047.0 (2069.4) -> 2046.8 (2070.2) MB, 645.4 / 0.0 ms (+ 9.0 ms in 43 steps since start of marking, biggest step 4.4 ms, walltime since start of marking 1429 ms) (average mu = 0.951, current mu = 0.

<--- JS stacktrace --->

FATAL ERROR: Reached heap limit Allocation failed - JavaScript heap out of memory
1: 0x7fe3a95f103c node::Abort() [/lib/x86_64-linux-gnu/libnode.so.108]
2: 0x7fe3a94e72f8 [/lib/x86_64-linux-gnu/libnode.so.108]
3: 0x7fe3a996207a v8::Utils::ReportOOMFailure(v8::internal::Isolate*, char const*, bool) [/lib/x86_64-linux-gnu/libnode.so.108]
4: 0x7fe3a9962437 v8::internal::V8::FatalProcessOutOfMemory(v8::internal::Isolate*, char const*, bool) [/lib/x86_64-linux-gnu/libnode.so.108]
5: 0x7fe3a9b4af19 [/lib/x86_64-linux-gnu/libnode.so.108]
6: 0x7fe3a9b610e3 v8::internal::Heap::CollectGarbage(v8::internal::AllocationSpace, v8::internal::GarbageCollectionReason, v8::GCCallbackFlags) [/lib/x86_64-linux-gnu/libnode.so.108]
7: 0x7fe3a9b3c236 v8::internal::HeapAllocator::AllocateRawWithLightRetrySlowPath(int, v8::internal::AllocationType, v8::internal::AllocationOrigin, v8::internal::AllocationAlignment) [/lib/x86_64-linux-gnu/libnode.so.108]
8: 0x7fe3a9b3d5e7 v8::internal::HeapAllocator::AllocateRawWithRetryOrFailSlowPath(int, v8::internal::AllocationType, v8::internal::AllocationOrigin, v8::internal::AllocationAlignment) [/lib/x86_64-linux-gnu/libnode.so.108]
9: 0x7fe3a9b1c526 v8::internal::Factory::AllocateRaw(int, v8::internal::AllocationType, v8::internal::AllocationAlignment) [/lib/x86_64-linux-gnu/libnode.so.108]
10: 0x7fe3a9b14568 v8::internal::FactoryBasev8::internal::Factory::AllocateRawWithImmortalMap(int, v8::internal::AllocationType, v8::internal::Map, v8::internal::AllocationAlignment) [/lib/x86_64-linux-gnu/libnode.so.108]
11: 0x7fe3a9b169fe v8::internal::FactoryBasev8::internal::Factory::NewRawOneByteString(int, v8::internal::AllocationType) [/lib/x86_64-linux-gnu/libnode.so.108]
12: 0x7fe3a9f8317a v8::internal::IncrementalStringBuilder::Extend() [/lib/x86_64-linux-gnu/libnode.so.108]
13: 0x7fe3a9f83346 v8::internal::IncrementalStringBuilder::AppendString(v8::internal::Handlev8::internal::String) [/lib/x86_64-linux-gnu/libnode.so.108]
14: 0x7fe3a9f3721b v8::internal::Runtime_RegExpReplaceRT(int, unsigned long*, v8::internal::Isolate*) [/lib/x86_64-linux-gnu/libnode.so.108]
15: 0x7fe3a98723b9 [/lib/x86_64-linux-gnu/libnode.so.108]
已放弃 (核心已转储)

截图 2023-04-25 14-32-37

Internal Error: Failed to launch the browser process

运行weread-spy时报错,提示无法打开浏览器,应该是puppeteer的问题,但不知道怎么处理

通过pnpm安装
pnpm v8.4.0
npm v9.5.1
nodejs v18.16.0
Windows10

错误代码

Internal Error: Failed to launch the browser process! undefined
[16308:12068:0508/024016.443:FATAL:chrome_resource_bundle_helper.cc(107)] Check failed: !actual_locale.empty(). Locale could not be found for
Backtrace:
        IsSandboxedProcess [0x00007FFCC20A1062+998194]
        IsSandboxedProcess [0x00007FFCC2E38402+15249106]
        IsSandboxedProcess [0x00007FFCC21311D2+1588386]
        IsSandboxedProcess [0x00007FFCC2132250+1592608]
        IsSandboxedProcess [0x00007FFCC20469A8+627832]
        ChromeMain [0x00007FFCBE7F1E95+3573]
        CrashForExceptionInNonABICompliantCodeRange [0x00007FFCC19FCAC2+24761666]
        CrashForExceptionInNonABICompliantCodeRange [0x00007FFCC19FC764+24760804]
        CrashForExceptionInNonABICompliantCodeRange [0x00007FFCC19FAF5F+24754655]
        CrashForExceptionInNonABICompliantCodeRange [0x00007FFCC19FB21E+24755358]
        ChromeMain [0x00007FFCBE7F133F+671]
        GetPakFileHashes [0x00007FF722EB29FF+6655]
        GetPakFileHashes [0x00007FF722EB1A96+2710]
        GetHandleVerifier [0x00007FF7230235B2+1030434]
        BaseThreadInitThunk [0x00007FFD5E787604+20]
        RtlUserThreadStart [0x00007FFD5F9426A1+33]
Crash keys:
  "reentry_guard_tls_slot" = "unused"
  "variations" = "db59f83a-3f4a17df,8bccc03b-3f4a17df,fabf21f1-3f4a17df,272b4158-3f4a17df,2510663e-73703436,"
  "num-experiments" = "5"
  "switch-27" = "about:blank"
  "switch-26" = "--remote-debugging-port=0"
  "switch-25" = "--user-data-dir=C:\Users\white\AppData\Local\weread-spy\Data\ppt"
  "switch-24" = "--use-mock-keychain"
  "switch-23" = "--password-store=basic"
  "switch-22" = "--no-first-run"
  "switch-21" = "--metrics-recording-only"
  "switch-20" = "--force-color-profile=srgb"
  "switch-19" = "--export-tagged-pdf"
  "switch-18" = "--enable-blink-features=IdleDetection"
  "switch-17" = "--disable-sync"
  "switch-16" = "--disable-renderer-backgrounding"
  "switch-15" = "--disable-prompt-on-repost"
  "switch-14" = "--disable-popup-blocking"
  "switch-13" = "--disable-ipc-flooding-protection"
  "switch-12" = "--disable-hang-monitor"
  "switch-11" = "--disable-extensions"
  "switch-10" = "--disable-dev-shm-usage"
  "switch-9" = "--disable-default-apps"
  "switch-8" = "--disable-component-update"
  "switch-7" = "--disable-component-extensions-with-background-pages"
  "switch-6" = "--disable-client-side-phishing-detection"
  "switch-5" = "--disable-breakpad"
  "switch-4" = "--disable-backgrounding-occluded-windows"
  "switch-3" = "--disable-background-timer-throttling"
  "switch-2" = "--disable-background-networking"
  "switch-1" = "--allow-pre-commit-input"
  "num-switches" = "29"
  "commandline-disabled-feature-5" = "OptimizationHints"
  "commandline-disabled-feature-4" = "MediaRouter"
  "commandline-disabled-feature-3" = "AcceptCHFrame"
  "commandline-disabled-feature-2" = "BackForwardCache"
  "commandline-disabled-feature-1" = "Translate"
  "commandline-enabled-feature-1" = "NetworkServiceInProcess2"



TROUBLESHOOTING: https://pptr.dev/troubleshooting

[16308:12068:0508/024016.443:FATAL:chrome_resource_bundle_helper.cc(107)] Check failed: !actual_locale.empty(). Locale could not be found for
Backtrace:
        IsSandboxedProcess [0x00007FFCC20A1062+998194]
        IsSandboxedProcess [0x00007FFCC2E38402+15249106]
        IsSandboxedProcess [0x00007FFCC21311D2+1588386]
        IsSandboxedProcess [0x00007FFCC2132250+1592608]
        IsSandboxedProcess [0x00007FFCC20469A8+627832]
        ChromeMain [0x00007FFCBE7F1E95+3573]
        CrashForExceptionInNonABICompliantCodeRange [0x00007FFCC19FCAC2+24761666]
        CrashForExceptionInNonABICompliantCodeRange [0x00007FFCC19FC764+24760804]
        CrashForExceptionInNonABICompliantCodeRange [0x00007FFCC19FAF5F+24754655]
        CrashForExceptionInNonABICompliantCodeRange [0x00007FFCC19FB21E+24755358]
        ChromeMain [0x00007FFCBE7F133F+671]
        GetPakFileHashes [0x00007FF722EB29FF+6655]
        GetPakFileHashes [0x00007FF722EB1A96+2710]
        GetHandleVerifier [0x00007FF7230235B2+1030434]
        BaseThreadInitThunk [0x00007FFD5E787604+20]
        RtlUserThreadStart [0x00007FFD5F9426A1+33]
Crash keys:
  "reentry_guard_tls_slot" = "unused"
  "variations" = "db59f83a-3f4a17df,8bccc03b-3f4a17df,fabf21f1-3f4a17df,272b4158-3f4a17df,2510663e-73703436,"
  "num-experiments" = "5"
  "switch-27" = "about:blank"
  "switch-26" = "--remote-debugging-port=0"
  "switch-25" = "--user-data-dir=C:\Users\white\AppData\Local\weread-spy\Data\ppt"
  "switch-24" = "--use-mock-keychain"
  "switch-23" = "--password-store=basic"
  "switch-22" = "--no-first-run"
  "switch-21" = "--metrics-recording-only"
  "switch-20" = "--force-color-profile=srgb"
  "switch-19" = "--export-tagged-pdf"
  "switch-18" = "--enable-blink-features=IdleDetection"
  "switch-17" = "--disable-sync"
  "switch-16" = "--disable-renderer-backgrounding"
  "switch-15" = "--disable-prompt-on-repost"
  "switch-14" = "--disable-popup-blocking"
  "switch-13" = "--disable-ipc-flooding-protection"
  "switch-12" = "--disable-hang-monitor"
  "switch-11" = "--disable-extensions"
  "switch-10" = "--disable-dev-shm-usage"
  "switch-9" = "--disable-default-apps"
  "switch-8" = "--disable-component-update"
  "switch-7" = "--disable-component-extensions-with-background-pages"
  "switch-6" = "--disable-client-side-phishing-detection"
  "switch-5" = "--disable-breakpad"
  "switch-4" = "--disable-backgrounding-occluded-windows"
  "switch-3" = "--disable-background-timer-throttling"
  "switch-2" = "--disable-background-networking"
  "switch-1" = "--allow-pre-commit-input"
  "num-switches" = "29"
  "commandline-disabled-feature-5" = "OptimizationHints"
  "commandline-disabled-feature-4" = "MediaRouter"
  "commandline-disabled-feature-3" = "AcceptCHFrame"
  "commandline-disabled-feature-2" = "BackForwardCache"
  "commandline-disabled-feature-1" = "Translate"
  "commandline-enabled-feature-1" = "NetworkServiceInProcess2"



TROUBLESHOOTING: https://pptr.dev/troubleshooting

    at ChildProcess.onClose (C:\Users\white\AppData\Local\pnpm\global\5\.pnpm\registry.npmmirror.com+@[email protected]\node_modules\@puppeteer\browsers\lib\cjs\launch.js:262:24)
    at ChildProcess.emit (node:events:525:35)
    at ChildProcess._handle.onexit (node:internal/child_process:291:12)

Error: Execution context was destroyed, most likely because of a navigation.

/usr/local/lib/node_modules/weread-spy/node_modules/puppeteer-core/lib/cjs/puppeteer/common/ExecutionContext.js:315
throw new Error('Execution context was destroyed, most likely because of a navigation.');
^

Error: Execution context was destroyed, most likely because of a navigation.
at rewriteError (/usr/local/lib/node_modules/weread-spy/node_modules/puppeteer-core/lib/cjs/puppeteer/common/ExecutionContext.js:315:15)
at runMicrotasks ()
at processTicksAndRejections (node:internal/process/task_queues:96:5)
at async ExecutionContext._ExecutionContext_evaluate (/usr/local/lib/node_modules/weread-spy/node_modules/puppeteer-core/lib/cjs/puppeteer/common/ExecutionContext.js:256:56)
at async CDPJSHandle.evaluateHandle (/usr/local/lib/node_modules/weread-spy/node_modules/puppeteer-core/lib/cjs/puppeteer/common/JSHandle.js:65:16)
at async Function.queryOne (/usr/local/lib/node_modules/weread-spy/node_modules/puppeteer-core/lib/cjs/puppeteer/common/QueryHandler.js:87:24)
at async CDPElementHandle.$ (/usr/local/lib/node_modules/weread-spy/node_modules/puppeteer-core/lib/cjs/puppeteer/common/ElementHandle.js:101:17)
at async CDPElementHandle.$eval (/usr/local/lib/node_modules/weread-spy/node_modules/puppeteer-core/lib/cjs/puppeteer/common/ElementHandle.js:108:31)
at async n (/usr/local/lib/node_modules/weread-spy/dist/bin.js:10:1461)
at async ge (/usr/local/lib/node_modules/weread-spy/dist/bin.js:10:2543)

图书链接 https://weread.qq.com/web/reader/5913240072122772591eb1fk43e327b025143ec517d680b

TypeError: Cannot read properties of null (reading '$store')

weread-spy 0.3.0 和 0.4.0 都报如下错误。尝试过升级 puppeteer-core 到最新版本,仍有这个问题

Error: Evaluation failed: TypeError: Cannot read properties of null (reading '$store')
at pptr://puppeteer_evaluation_script:2:34
at ExecutionContext._ExecutionContext_evaluate (/private/tmp/test/weread-spy/node_modules/.pnpm/[email protected][email protected]/node_modules/puppeteer-core/src/common/ExecutionContext.ts:319:13)
at processTicksAndRejections (node:internal/process/task_queues:95:5)
at async ExecutionContext.evaluate (/private/tmp/test/weread-spy/node_modules/.pnpm/[email protected][email protected]/node_modules/puppeteer-core/src/common/ExecutionContext.ts:181:12)
at async CDPJSHandle.evaluate (/private/tmp/test/weread-spy/node_modules/.pnpm/[email protected][email protected]/node_modules/puppeteer-core/src/common/JSHandle.ts:74:12)
at async CDPElementHandle.$eval (/private/tmp/test/weread-spy/node_modules/.pnpm/[email protected][email protected]/node_modules/puppeteer-core/src/common/ElementHandle.ts:145:20)
at async waitCondition (/private/tmp/test/weread-spy/src/commands/one.ts:102:12)
at async decideDownload (/private/tmp/test/weread-spy/src/commands/one.ts:111:3)

下载报错:epubcheck 返回Duplicate ID 和could not be found 错误

wro 18:08:49
ua = Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/118.0.0.0 Safari/537.36

当前浏览链接像是一本书:
[url]: https://weread.qq.com/web/reader/a72320c072289166a7200ffkc81322c012c81e728d9d180
[title]: AI 3.0-梅拉妮·米歇尔-微信读书
? 是否下载: Yes
weread-spy:download 已收集章节 id=2 +0ms
weread-spy:download 已收集章节 id=3 +478ms
weread-spy:download 已收集章节 id=4 +582ms
weread-spy:download 已收集章节 id=5 +1s
weread-spy:download 已收集章节 id=6 +1s
weread-spy:download 已收集章节 id=7 +1s
weread-spy:download 已收集章节 id=8 +519ms
weread-spy:download 已收集章节 id=9 +2s
weread-spy:download 已收集章节 id=10 +855ms
weread-spy:download 已收集章节 id=11 +2s
weread-spy:download 已收集章节 id=12 +508ms
weread-spy:download 已收集章节 id=13 +1s
weread-spy:download 已收集章节 id=14 +1s
weread-spy:download 已收集章节 id=15 +1s
weread-spy:download 已收集章节 id=16 +1s
weread-spy:download 已收集章节 id=17 +494ms
weread-spy:download 已收集章节 id=18 +1s
weread-spy:download 已收集章节 id=19 +1s
weread-spy:download 已收集章节 id=20 +1s
weread-spy:download 已收集章节 id=21 +514ms
weread-spy:download 已收集章节 id=22 +1s
weread-spy:download 已收集章节 id=23 +1s
weread-spy:download 已收集章节 id=24 +2s
weread-spy:download 已收集章节 id=25 +519ms
weread-spy:download 已收集章节 id=26 +1s
weread-spy:download 已收集章节 id=27 +1s
weread-spy:download 已收集章节 id=28 +1s
weread-spy:download 已收集章节 id=29 +566ms
weread-spy:download 已收集章节 id=30 +3s
weread-spy:download 已收集章节 id=31 +459ms
weread-spy:download 已收集章节 id=32 +888ms
weread-spy:download book id = 36213094 url = https://weread.qq.com/web/reader/a72320c072289166a7200ffkc81322c012c81e728d9d180 +135ms
weread-spy:download downloaded to /Users/lion/Library/Application Support/weread-spy/books/36213094-AI 3.0.json +0ms
weread-spy:one -------------------- download complete -------------------- +0ms
weread-spy:utils:epub epubgen 36213094 -> /Users/lion/AI 3.0.epub +0ms
weread-spy:utils:epub-img cleaning: remove imgs.json /Users/lion/Library/Application Support/weread-spy/books/36213094-AI 3.0/imgs.json +0ms
weread-spy:utils:epub-img cleaning: remove imgs dir /Users/lion/Library/Application Support/weread-spy/books/36213094-AI 3.0/imgs +1ms
weread-spy:utils:epub-img imgSrcs collected, length = 58, unique length = 58 +1s
weread-spy:utils:epub-img add cover url = https://wfqqreader-1252317822.image.myqcloud.com/cover/94/36213094/t9_36213094.jpg +0ms
weread-spy:utils:epub-img download img complete +2s
2023-10-29T10:11:18.478Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=6
2023-10-29T10:11:18.494Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=7
2023-10-29T10:11:18.495Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=5
2023-10-29T10:11:18.509Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=8
2023-10-29T10:11:18.543Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=2
2023-10-29T10:11:18.580Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=4
2023-10-29T10:11:18.591Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=3
2023-10-29T10:11:18.709Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=9
2023-10-29T10:11:18.819Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=11
2023-10-29T10:11:18.819Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=10
2023-10-29T10:11:19.157Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=13
2023-10-29T10:11:19.183Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=14
2023-10-29T10:11:19.303Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=15
2023-10-29T10:11:19.391Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=16
2023-10-29T10:11:19.587Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=17
2023-10-29T10:11:19.639Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=18
2023-10-29T10:11:19.651Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=19
2023-10-29T10:11:19.787Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=20
2023-10-29T10:11:19.930Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=21
2023-10-29T10:11:19.975Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=23
2023-10-29T10:11:19.976Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=22
2023-10-29T10:11:20.051Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=24
2023-10-29T10:11:20.228Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=28
2023-10-29T10:11:20.382Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=29
2023-10-29T10:11:20.415Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=30
2023-10-29T10:11:20.478Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=31
2023-10-29T10:11:20.497Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=32
2023-10-29T10:11:20.086Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=12
2023-10-29T10:11:20.142Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=25
2023-10-29T10:11:20.201Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=26
2023-10-29T10:11:20.832Z weread-spy:utils:processContent processContent for title=AI 3.0 chapterUid=27
weread-spy:utils:epub processContent cost 3753 ms +7s
weread-spy:utils:epub epub created: /Users/lion/AI 3.0.epub +1s
weread-spy:utils:epubcheck [exec]: java -jar '/Users/lion/Library/pnpm/global/5/.pnpm/[email protected]/node_modules/epubcheck-assets/assets/epubcheck-5.1.0/epubcheck.jar' '/Users/lion/AI 3.0.epub' +0ms
Validating using EPUB version 3.3 rules.
ERROR(RSC-005): ./AI 3.0.epub/OEBPS/chapter-9.xhtml(149,320): Error while parsing file: Duplicate ID "wz_2_19"
ERROR(RSC-005): ./AI 3.0.epub/OEBPS/chapter-9.xhtml(150,91): Error while parsing file: Duplicate ID "wz_2_19"
ERROR(RSC-005): ./AI 3.0.epub/OEBPS/chapter-19.xhtml(150,316): Error while parsing file: Duplicate ID "wz_10_25"
ERROR(RSC-005): ./AI 3.0.epub/OEBPS/chapter-19.xhtml(151,93): Error while parsing file: Duplicate ID "wz_10_25"
ERROR(RSC-005): ./AI 3.0.epub/OEBPS/chapter-23.xhtml(130,385): Error while parsing file: Duplicate ID "jzyy_0_274"
ERROR(RSC-005): ./AI 3.0.epub/OEBPS/chapter-23.xhtml(131,95): Error while parsing file: Duplicate ID "jzyy_0_274"
ERROR(RSC-005): ./AI 3.0.epub/OEBPS/chapter-28.xhtml(150,285): Error while parsing file: Duplicate ID "wz_17_22"
ERROR(RSC-005): ./AI 3.0.epub/OEBPS/chapter-28.xhtml(151,93): Error while parsing file: Duplicate ID "wz_17_22"
ERROR(RSC-007): ./AI 3.0.epub/OEBPS/chapter-9.xhtml(150,91): Referenced resource "Text/postscript3.xhtml" could not be found in the EPUB.
ERROR(RSC-007): ./AI 3.0.epub/OEBPS/chapter-19.xhtml(151,93): Referenced resource "Text/postscript3.xhtml" could not be found in the EPUB.
ERROR(RSC-007): ./AI 3.0.epub/OEBPS/chapter-23.xhtml(131,95): Referenced resource "Text/postscript3_2.xhtml" could not be found in the EPUB.
ERROR(RSC-007): ./AI 3.0.epub/OEBPS/chapter-28.xhtml(151,93): Referenced resource "Text/postscript3.xhtml" could not be found in the EPUB.

Check finished with errors
Messages: 0 fatals / 12 errors / 0 warnings / 0 infos

EPUBCheck completed
Error: Command failed: java -jar '/Users/lion/Library/pnpm/global/5/.pnpm/[email protected]/node_modules/epubcheck-assets/assets/epubcheck-5.1.0/epubcheck.jar' '/Users/lion/AI 3.0.epub'
at checkExecSyncError (node:child_process:890:11)
at execSync (node:child_process:962:15)
at M (file:///Users/lion/Library/pnpm/global/5/.pnpm/[email protected][email protected]/node_modules/weread-spy/dist/bin.js:2:529)
at ye (file:///Users/lion/Library/pnpm/global/5/.pnpm/[email protected][email protected]/node_modules/weread-spy/dist/bin.js:26:13522)
at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
at async ir (file:///Users/lion/Library/pnpm/global/5/.pnpm/[email protected][email protected]/node_modules/weread-spy/dist/bin.js:27:2050)

weread-spy:one -------------------- generate complete -------------------- +21s
weread-spy:one epub 文件: /Users/lion/AI 3.0.epub +1ms

抓取书籍时出错

C:\Users\46266\AppData\Local\pnpm\global\5.pnpm\[email protected]\node_modules\puppeteer\lib\cjs\puppeteer\common\ExecutionContext.js:286
throw new Error('Evaluation failed: ' + (0, util_js_1.getExceptionMessage)(exceptionDetails));
^

Error: Evaluation failed: TypeError: Cannot read properties of undefined (reading '$store')
at pptr://puppeteer_evaluation_script:2:34
at ExecutionContext._ExecutionContext_evaluate (C:\Users\46266\AppData\Local\pnpm\global\5.pnpm\[email protected]\node_modules\puppeteer\lib\cjs\puppeteer\common\ExecutionContext.js:286:15)
at processTicksAndRejections (node:internal/process/task_queues:96:5)
at async ExecutionContext.evaluate (C:\Users\46266\AppData\Local\pnpm\global\5.pnpm\[email protected]\node_modules\puppeteer\lib\cjs\puppeteer\common\ExecutionContext.js:117:16)
at async ElementHandle.evaluate (C:\Users\46266\AppData\Local\pnpm\global\5.pnpm\[email protected]\node_modules\puppeteer\lib\cjs\puppeteer\common\JSHandle.js:105:16)
at async ElementHandle.$eval (C:\Users\46266\AppData\Local\pnpm\global\5.pnpm\[email protected]\node_modules\puppeteer\lib\cjs\puppeteer\common\ElementHandle.js:665:24)
at async waitCondition (C:\Users\46266\AppData\Local\pnpm\global\5.pnpm\[email protected]\node_modules\weread-spy\lib\commands\one.js:90:18)
at async decideDownload (C:\Users\46266\AppData\Local\pnpm\global\5.pnpm\[email protected]\node_modules\weread-spy\lib\commands\one.js:98:5)

chapter css 重复

image

这些css的内容其实是一模一样的,希望能够优化一下,只保留一个

下载出错,希望大佬可以继续维护

报错信息如下:
``
PS C:\Users\inssu> weread-spy one
ua = Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/116.0.0.0 Safari/537.36

当前浏览链接像是一本书:
[url]: https://weread.qq.com/web/reader/e1932d70813ab82e7g014f5b
[title]: 海绵阅读法:如何吸收一本书的精华-李小墨-微信读书
? 是否下载: (Y/n)

当前浏览链接像是一本书:
[url]: https://weread.qq.com/web/reader/e1932d70813ab82e7g014f5b?
[title]: 海绵阅读法:如何吸收一本书的精华-李小墨-微信读书
? 是否下载: Yes
file:///C:/Users/inssu/AppData/Local/Yarn/Data/global/node_modules/puppeteer-core/lib/esm/puppeteer/common/WaitTask.js:56
void this.terminate(new TimeoutError(Waiting failed: ${options.timeout}ms exceeded));
^

TimeoutError: Waiting failed: 30000ms exceeded
at Timeout. (file:///C:/Users/inssu/AppData/Local/Yarn/Data/global/node_modules/puppeteer-core/lib/esm/puppeteer/common/WaitTask.js:56:37)
at listOnTimeout (node:internal/timers:573:17)
at process.processTimers (node:internal/timers:514:7)
``

更换UA更方便

使用Kindle UA可以更加方便生成epub
Mozilla/5.0 (X11; U; Linux armv7l like Android; en-us) AppleWebKit/531.2+ (KHTML, like Gecko) Version/5.0 Safari/533.2+ Kindle/3.0+

使用该UA在网页访问微信读书是kindle页面,文字图片都可以复制,源代码都有

weread-spy的命令行

  • weread-spy dl -u <url> 无法启动,if (!justLaunch || !url) {有误?
  • weread-spy gen -u <url>不指定id无法生成,找不到id,读的map为空?

下载的电子书页面组织错乱

大佬开发的工具真的很强!!!
但是在使用过程中会有如下几个问题:
1.如楼上所说,注释没法抓取下来。

2.抓取生成的epub页面,目录处页码前的数字会发生重叠,如图:
image

3.图书中的代码超出页面,如图:
image

4.很多页面可能几行文字就占用了一页,段落之间跨度太大,如图:
image

大佬在百忙中看看能不能修复一下~[爱心]
不甚感激!!![崇拜]

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.