GithubHelp home page GithubHelp logo

Comments (21)

Flameish avatar Flameish commented on May 26, 2024

The cover and table of contents page is already implemented and will be in the next release which should be done tomorrow/next few days.
The invalid character naming is also fixed. (Thanks for pointing it out, I forgot about it!)

I'm not too sure about adding an "About" segment, at least having it done automatically. There are many sites which do not have clean and easy to scrap novel descriptions which would make this really messy.
On the other hand I don't quite see the point of having a credit page, with a link to the novel site on top. At least not for my own personal copy.
I try to set the author and translator both as the epub "author" through.

from novel-grabber.

RuthlessRuler avatar RuthlessRuler commented on May 26, 2024

Okay.. Having only TOC is also good. Also, You can look more into re structuring the ebook along with additional options Option that would help the user customize the EPUBS like HR tag should be included or not and ETC.

Anyways, Thanks for such a great piece of software!

from novel-grabber.

Flameish avatar Flameish commented on May 26, 2024

I hope I understood you correctly with the HR tag thing you mentioned. Novel tags like "Action, Adventure, Sci-Fi" etc. from Wuxiaworld/RoyalRoad (and other sites which list them) are now added to EPUBs.

from novel-grabber.

RuthlessRuler avatar RuthlessRuler commented on May 26, 2024

IN the file called Toc.ncx in the EPUB, If possible dd the location of coverPages.html too.
And by HR i meant the hr tag on the last of each chapter that is being placed.

from novel-grabber.

Flameish avatar Flameish commented on May 26, 2024

I can add the coverPage to the toc.ncx, that's no problem. But I probably will remove the Table of Contents page from the .ncx since its right after the cover. It's a bit cleaner that way.

You want the option to have a


at the end of each chapter? Sure I can add that as well.

from novel-grabber.

RuthlessRuler avatar RuthlessRuler commented on May 26, 2024

Its already there. I want the option to remove it. That's all.

from novel-grabber.

Flameish avatar Flameish commented on May 26, 2024

Now I'm confused. There is no hr tag/line at the end of the chapters, at least none which I added. Can you give me some info on which novel this is happening or is it everywhere?

Edit: I checked Tales of Demons & Gods and the hr tag comes from the chapter itself. You can add the hr tag to the blacklisted tags and download it manually.
Here is an image of my settings

from novel-grabber.

RuthlessRuler avatar RuthlessRuler commented on May 26, 2024

Oh. Thanks for the help!
I though it was built in function but it doesn't seems so.

In future, do you have any plans to add Webnovel.com support natively?

from novel-grabber.

Flameish avatar Flameish commented on May 26, 2024

No. The ToC page is loaded dynamically after a user clicks on the table of contents tab, its "invisible" to my HTML parser. It might be possible to get them manually with chapter-to-chapter but alot of the chapters are behind a paywall anyway.

from novel-grabber.

RuthlessRuler avatar RuthlessRuler commented on May 26, 2024

Yeah. It's same for me. But you can check there are many scripts on Github that are able to Download from Webnovel.com. Try to look into them and see if you can implement them!

from novel-grabber.

RuthlessRuler avatar RuthlessRuler commented on May 26, 2024

Also, is it possible to add a native support for MTL websites? Babelnovel.com can be implemented easily but other like comradeo/lnmtl and others have Chinese Words in the same page too(When using Reader Mode).

from novel-grabber.

Flameish avatar Flameish commented on May 26, 2024

I don't know how I feel about bypassing a paywall and If I want to implement that. So webnovel.com is a no for now. Babelnovel is not a static website aswell (or as far as I saw). Maybe you can elaborate how I might be able to add the site easily without having to implement a full support for dynamic websites? I might take a look at that in the future through.

Im not familiar with the term "MTL". Are you talking about Modern Taiwanese Language? I think it would be best if you open a new issue labeled "supported website request" or something like that where you specify which websites you have in mind and I'll see what I can do. Generally, I only add native support for websites which have more than 10 active novels on them, manual grabbing was intended for everything else.

from novel-grabber.

RuthlessRuler avatar RuthlessRuler commented on May 26, 2024

I'm not asking to Scrape premium novels. Free novels do exists on Webnovel which can be scraped.
Also MTL refers to Machine Translated Novels. Eg. https://lnmtl.com/ & https://comrademao.com/.
IDK what are dynamic websits, but if you see that babelnovel has an easy URL of chapters. Like babelnovel.com/novel-name/chapter-xxx/ So if users can define Chapters AND INFORMATION CAN BE SCRAPED FROM THE BABELNOVEL'S WEBSITE!

from novel-grabber.

Flameish avatar Flameish commented on May 26, 2024

I did notice the URL of babel and thought of implementing a third manual grabbing method which works directly with URLs. However, it is still not possible at the moment without big adjustments (Like I said, maybe in the future), because each chapter on Babel is loaded dynamically as well, meaning, the chapter content is fetched, after the page loaded, with a script. Static websites, like for e.g Wuxiaworld, have their chapters fully "inserted" in HTML on the very first page load and I'm able to easily work with that. Until I implement a miniature browser/AJAX scrapper or whatever I really need, dynamic sites wont be possible.

lnmtl.com and comrademao.com will not work "automatically", both will have to use the manual "chapter-to-chapter" method. I'll think about implementing a hybrid of automatic and manual grabbing for sites likes these where you have to input the novel URL, first/starting chapter URL and last/stopping chapter URL (because I don't know how many chapters there are) on the automatic tab.

from novel-grabber.

RuthlessRuler avatar RuthlessRuler commented on May 26, 2024

Okay! Thanks a lot!

from novel-grabber.

Flameish avatar Flameish commented on May 26, 2024

You can download from Webnovel.com now (non vip chapters)

from novel-grabber.

RuthlessRuler avatar RuthlessRuler commented on May 26, 2024

The Cover images that is downloaded from Webnovel is of very low quality.
Suppose if the book is: https://www.webnovel.com/book/8094015805004305/Tales-of-Demons-and-Gods then the cover image that is downloaded is: https://img.webnovel.com/bookcover/8094015805004305/150/150.jpg but the cover like https://img.webnovel.com/bookcover/8094015805004305/600/600.jpg and https://img.webnovel.com/bookcover/8094015805004305/300/300.jpg exists.

Also, the close Button (X) doesnt work in Edit Blacklisted Tags(The Hamburger Menu). Only clicking Ok does the dialog Box closes.

from novel-grabber.

RuthlessRuler avatar RuthlessRuler commented on May 26, 2024

Some more stuff: There are actually 2 Same Photos(The Book Cover) with different name which are cover.jpg and 150.jpg(the name of the cover image downloaded as it is) in the EPUB.

The Chapter name is also not there at the start of Each Chapter (Possibly Webnovel Specific) and there is no Numbering of Chapters in TOC too.

Also, the " ' " without space is shown as รขโ‚ฌโ„ขs in the EPUB. Like :

image

from novel-grabber.

Flameish avatar Flameish commented on May 26, 2024

Thanks for your report! I'm sorry for the late response. Check out the latest version, everything should work correctly now. (Except the encoding issue, I'm still working on it and just put a crude hotfix in place for it)

from novel-grabber.

RuthlessRuler avatar RuthlessRuler commented on May 26, 2024

Thanks for the update. The issue chapter title missing at start of each chapter is still there in WebNovel Downloads. Also,the cover can be downloaded of higher size (600.jpg) from webnovel

from novel-grabber.

Flameish avatar Flameish commented on May 26, 2024

I've updated the 2.1.4 jar.

from novel-grabber.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.