Comments (21)
The cover and table of contents page is already implemented and will be in the next release which should be done tomorrow/next few days.
The invalid character naming is also fixed. (Thanks for pointing it out, I forgot about it!)
I'm not too sure about adding an "About" segment, at least having it done automatically. There are many sites which do not have clean and easy to scrap novel descriptions which would make this really messy.
On the other hand I don't quite see the point of having a credit page, with a link to the novel site on top. At least not for my own personal copy.
I try to set the author and translator both as the epub "author" through.
from novel-grabber.
Okay.. Having only TOC is also good. Also, You can look more into re structuring the ebook along with additional options Option that would help the user customize the EPUBS like HR tag should be included or not and ETC.
Anyways, Thanks for such a great piece of software!
from novel-grabber.
I hope I understood you correctly with the HR tag thing you mentioned. Novel tags like "Action, Adventure, Sci-Fi" etc. from Wuxiaworld/RoyalRoad (and other sites which list them) are now added to EPUBs.
from novel-grabber.
IN the file called Toc.ncx in the EPUB, If possible dd the location of coverPages.html too.
And by HR i meant the hr tag on the last of each chapter that is being placed.
from novel-grabber.
I can add the coverPage to the toc.ncx, that's no problem. But I probably will remove the Table of Contents page from the .ncx since its right after the cover. It's a bit cleaner that way.
You want the option to have a
at the end of each chapter? Sure I can add that as well.
from novel-grabber.
Its already there. I want the option to remove it. That's all.
from novel-grabber.
Now I'm confused. There is no hr tag/line at the end of the chapters, at least none which I added. Can you give me some info on which novel this is happening or is it everywhere?
Edit: I checked Tales of Demons & Gods and the hr tag comes from the chapter itself. You can add the hr tag to the blacklisted tags and download it manually.
Here is an image of my settings
from novel-grabber.
Oh. Thanks for the help!
I though it was built in function but it doesn't seems so.
In future, do you have any plans to add Webnovel.com support natively?
from novel-grabber.
No. The ToC page is loaded dynamically after a user clicks on the table of contents tab, its "invisible" to my HTML parser. It might be possible to get them manually with chapter-to-chapter but alot of the chapters are behind a paywall anyway.
from novel-grabber.
Yeah. It's same for me. But you can check there are many scripts on Github that are able to Download from Webnovel.com. Try to look into them and see if you can implement them!
from novel-grabber.
Also, is it possible to add a native support for MTL websites? Babelnovel.com can be implemented easily but other like comradeo/lnmtl and others have Chinese Words in the same page too(When using Reader Mode).
from novel-grabber.
I don't know how I feel about bypassing a paywall and If I want to implement that. So webnovel.com is a no for now. Babelnovel is not a static website aswell (or as far as I saw). Maybe you can elaborate how I might be able to add the site easily without having to implement a full support for dynamic websites? I might take a look at that in the future through.
Im not familiar with the term "MTL". Are you talking about Modern Taiwanese Language? I think it would be best if you open a new issue labeled "supported website request" or something like that where you specify which websites you have in mind and I'll see what I can do. Generally, I only add native support for websites which have more than 10 active novels on them, manual grabbing was intended for everything else.
from novel-grabber.
I'm not asking to Scrape premium novels. Free novels do exists on Webnovel which can be scraped.
Also MTL refers to Machine Translated Novels. Eg. https://lnmtl.com/ & https://comrademao.com/.
IDK what are dynamic websits, but if you see that babelnovel has an easy URL of chapters. Like babelnovel.com/novel-name/chapter-xxx/ So if users can define Chapters AND INFORMATION CAN BE SCRAPED FROM THE BABELNOVEL'S WEBSITE!
from novel-grabber.
I did notice the URL of babel and thought of implementing a third manual grabbing method which works directly with URLs. However, it is still not possible at the moment without big adjustments (Like I said, maybe in the future), because each chapter on Babel is loaded dynamically as well, meaning, the chapter content is fetched, after the page loaded, with a script. Static websites, like for e.g Wuxiaworld, have their chapters fully "inserted" in HTML on the very first page load and I'm able to easily work with that. Until I implement a miniature browser/AJAX scrapper or whatever I really need, dynamic sites wont be possible.
lnmtl.com and comrademao.com will not work "automatically", both will have to use the manual "chapter-to-chapter" method. I'll think about implementing a hybrid of automatic and manual grabbing for sites likes these where you have to input the novel URL, first/starting chapter URL and last/stopping chapter URL (because I don't know how many chapters there are) on the automatic tab.
from novel-grabber.
Okay! Thanks a lot!
from novel-grabber.
You can download from Webnovel.com now (non vip chapters)
from novel-grabber.
The Cover images that is downloaded from Webnovel is of very low quality.
Suppose if the book is: https://www.webnovel.com/book/8094015805004305/Tales-of-Demons-and-Gods then the cover image that is downloaded is: https://img.webnovel.com/bookcover/8094015805004305/150/150.jpg but the cover like https://img.webnovel.com/bookcover/8094015805004305/600/600.jpg and https://img.webnovel.com/bookcover/8094015805004305/300/300.jpg exists.
Also, the close Button (X) doesnt work in Edit Blacklisted Tags(The Hamburger Menu). Only clicking Ok does the dialog Box closes.
from novel-grabber.
Some more stuff: There are actually 2 Same Photos(The Book Cover) with different name which are cover.jpg and 150.jpg(the name of the cover image downloaded as it is) in the EPUB.
The Chapter name is also not there at the start of Each Chapter (Possibly Webnovel Specific) and there is no Numbering of Chapters in TOC too.
Also, the " ' " without space is shown as รขโฌโขs in the EPUB. Like :
from novel-grabber.
Thanks for your report! I'm sorry for the late response. Check out the latest version, everything should work correctly now. (Except the encoding issue, I'm still working on it and just put a crude hotfix in place for it)
from novel-grabber.
Thanks for the update. The issue chapter title missing at start of each chapter is still there in WebNovel Downloads. Also,the cover can be downloaded of higher size (600.jpg) from webnovel
from novel-grabber.
I've updated the 2.1.4 jar.
from novel-grabber.
Related Issues (20)
- [SITE BROKEN] Dreame and Inkitt HOT 1
- [Site Request] Please add the site -- https://novelgate.net/
- [BUG] www.wuxiaworld.com
- [SITE REQUEST] https://muses-success.info
- [BUG] - https://comrademao.com/
- [BUG] Automatic downloads are freezing inconsistently
- Step by step procedure for installation HOT 1
- [SITE REQUEST] Booktoki
- manual grabbing
- [BUG] Headerless browser broken HOT 10
- [SITE BROKEN] Royal Road title and author detection broken HOT 2
- Confused HOT 1
- [BUG] help me
- [BUG] Cannot grab at lower screen resolution HOT 2
- https://www.webnovel.com [SITE BROKEN]
- [SITE REQUEST] https://shubaowb.com/
- [BUG] MalformedURLException at previous working website
- [SITE REQUEST] Webfic
- Request to make this as an app apk for those who use only mobile phones pls:))
- [SITE REQUEST] Novel Bin
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from novel-grabber.