GithubHelp home page GithubHelp logo

Comments (5)

JockeTF avatar JockeTF commented on July 28, 2024

All of the metadata (except for archive) come directly from Fimfiction. So, when something goes missing it's essentially frozen in time. It's not really by design, but I just haven't done all that much when it comes to cleaning things up.

There is a way for you to filter out stories that are no longer available though! You can do this by looking in to the (somewhat poorly named) timestamps in archive. Note that some of these are null since I haven't retroactively added things such as creation dates.

  • date_checked: When did we last try to update the story?
  • date_created: When was the story added to the archive?
  • date_fetched: When was the last update of the metadata?
  • date_updated: When was the last update of the content?

The other timestamps will exactly match date_checked if a change happened for that version of the archive. So, to check if a story was publicly available on Fimfiction you could compare date_checked to date_fetched.

>>> from fimfarchive.fetchers import FimfarchiveFetcher
>>> 
>>> def was_available(story):
...     archive = story.meta['archive']
...     date_checked = archive['date_checked']
...     date_fetched = archive['date_fetched']
... 
...     return date_checked == date_fetched
... 
>>> 
>>> fetcher = FimfarchiveFetcher('fimfarchive.zip')
>>> sum(1 for story in fetcher if was_available(story))
131322

Hope that helps!

from fimfarchive.

JockeTF avatar JockeTF commented on July 28, 2024

Also, I'll be looking into story 31718.

It seems you might have found a rather significant bug, so thank you!

from fimfarchive.

htnyquist avatar htnyquist commented on July 28, 2024

Thanks, date_fetched and date_checked sound like exactly what I need!
I am curious about the case of 31718, but ultimately if it's limited to the series tag that's something I can deal with by just assuming any fic without one is automatically 'MLP-FiM' (I assume that's what Fimfic did retroactively).

from fimfarchive.

JockeTF avatar JockeTF commented on July 28, 2024

@htnyquist

I kept track of 31718 during the last archive update, and it seems to have updated fine without me doing anything. I believe what happened was that the story had been unpublished for a while. It probably got published again some time after the previous release.

from fimfarchive.

JockeTF avatar JockeTF commented on July 28, 2024

There seem to be comments hinting at that as well!

from fimfarchive.

Related Issues (4)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.