GithubHelp home page GithubHelp logo

gorbash1370 / humble-bundle-book-info Goto Github PK

View Code? Open in Web Editor NEW
48.0 48.0 1.0 3.44 MB

Automated retrieval of Humble Book Bundle information, outputted to txt file. Includes Amazon review ratings, ISBNs, ASINs, published date, retail prices and gathers useful URLs to explore the titles (i.e. Google Books previews & Amazon ebook page).

License: GNU General Public License v3.0

Python 100.00%

humble-bundle-book-info's Introduction

Welcome! ๐Ÿฒ ๐Ÿ‘‹ ๐Ÿฑโ€๐Ÿ‘ค

Page not displaying correctly? Here's the boring plain version

About gorbash1370

Addiction | Learning

Style | Foundational knowledge. Learn-By-Do

Self-taught | Humble Book Bundles, ChatGPT, Copilot

Tech Podcasts | Security Now, Unsupervised Learning, Darknet Diaries

Interests | CyberSec, Networking, (the power of) Code, Automation, AI (especially art & general manipulation!)

Direction | Career switch into CyberSecurity, Networking & Network Security

badge security badge ai badge data science

Languages & Tools

icon python icon mysql icon js

line

My Repos

Hobby-coding for only about 6 months at the start of 2024, my GitHub is a "Learning in Public" endeavour, hosting my practice projects. On the plus side, my code should be accessible for other learners and fairly simple to scrutinise!

If anyone happens to find my small scripts useful then

Hopefully over time my work will attest to an increase in sophistication!

Projects

whisper-wrapper | A simple but feature-rich implementation of OpenAI's Whisper transcription model. My first published coding project!

rename-packt-vids | Simple, niche script that renames Packt course video files (format video1_1) according to their individual Series/Module and Episode titles. Feeds in nicely to whisper-wrapper.

humble-bundle-book-info | Script does a little web-scraping via Google Books API & Bing Search API to automate researching individual titles in Humble Book Bundles, just using the HB webpage url. All the gathered information is output to nicely formatted text files. See /HumbleBundleBooks for sample outputs.

security-now-files | Automated fetch of transcripts & shownotes for the fantastic Security Now podcast. Includes options for downloading just the latest episode, a specific range of episodes or the whole back-catalogue (in server-respectful fashion, of course!) and the ability to select target file formats (pdf, txt or htm).

More Coming Soon!

If you do happen to have reeeeeally enjoyed my work then what I would most appreciate is a pat on the back, but, Contributions to the Caffine Fund โ˜• are always welcome too.

Super n00b Stats

gorbash1370's GitHub stats

Collab

buffer leftbuffer right

Having benefited greatly from the generous sharing by others of their skills and output, I'm really passionate about the Open Source philosophy and hope to contribute meaningfully to the community.

This profile is deliberately explicit about my experience and stage in the learning journey: if anyone has a project I could work on, wants to collab or is interested in mentoring, please do reach out!

Feedback and constructive criticism on my work is always welcome.

Contact
line

Credits & Thanks

Widgets & Resources Used In This Profile

github-readme-stats
MikeCodesDotNET/ColoredBadges
qkrdmstlr3/techstack-generator
Slackmojis

Profile Inspiration

abhisheknaiidu
trinib
supuna97

line

gif cat roomba why not

Who doesn't love a cat-on-a-roomba?!
Thanks for stopping by!

humble-bundle-book-info's People

Contributors

gorbash1370 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

werefrog

humble-bundle-book-info's Issues

Bug caused by missing authors in selected books presented by Make (Electronics & Design) bundle 24 05 01

Electronics and Design for Entrepreneurs, Tech Book Bundle by Make presented an issue for the code whereby only 12 book entries were being returned for a Bundle of 15 - no error messages produced by the code.

Upon further investigation, the sychronisation of authors to the title-blurbs was also being mismatched for some books.

Inspection of the Selenium retrieved html/js and the webpage direct, revealed that 3 books provided no Author: field or value (only Publisher: Make).

The original code assumed that all books will have Title - Author - Blurb.
The assumption wasn't unreasonable on the experience of c20 bundles processed without issue, but now presents a flaw.
The code parses the retrieved strings that represent the Title - Author - Blurb fields positionally into the dictionary that orchestrates the API calls, and creates the text files for output.

Therefore, when any field is missing from a single book entry it will mess up the correct matching between the three fields across the individual books in the bundle.

Missing fields therefore present a fatal flaw that contaminate the book information retrieval at the API call stage.

An immediate fix requires identifying and handling (probably via placeholder substitution) instances where an Author field is totally absent for a book. May later have to extend this fallback capability to Title (very unlikely) and Blurb fields too.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.