igorcmoura / anitopy Goto Github PK
View Code? Open in Web Editor NEWAnime video filename parser in Python, based on Anitomy
License: Mozilla Public License 2.0
Anime video filename parser in Python, based on Anitomy
License: Mozilla Public License 2.0
The library failed to recognize some anime if the versioning right next to the episode number:
[Judas] Aharen-san wa Hakarenai - S01E06v2.mkv
I was wondering if you have any plans to add seasonal information in titles? e.g:
[HorribleSubs] Mob Psycho 100 S2 - 07 [1080p].mkv
Thank you for this neat and handy tool, really saved me from a lot of headaches
When there's a filename in the following format —
ANIME_TITLE Season X ANIME_TYPE [round, square or curly brackets] EXACT_SAME_ANIME_TYPE Episode Y
— anitopy
fails to parse it and throws an exception.
It only triggers when the same anime_type
is repeated once or more and there's something (or nothing) in brackets between them. The error doesn't happen when I omit the season or episode number.
>>> anitopy.parse("One-Punch Man Season 1 OVA [Judas] OVA - 05.mkv")
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/usr/lib/python3.9/site-packages/anitopy/anitopy.py", line 48, in parse
if not parser.parse():
File "/usr/lib/python3.9/site-packages/anitopy/parser.py", line 33, in parse
self.validate_elements()
File "/usr/lib/python3.9/site-packages/anitopy/parser.py", line 302, in validate_elements
Elements.erase(ElementCategory.EPISODE_TITLE)
File "/usr/lib/python3.9/site-packages/anitopy/element.py", line 107, in erase
del cls.instance()._elements[category.value]
KeyError: 'episode_title'
One use case for making file name uniform by detecting the delimiter used and replace it with space.
The output could have a field called "delimiter_detected" for this if the delimiter can be determined
The release group isn't properly parsed from some files that use scene naming.
Example:
{'file_name': 'JoJos.Bizarre.Adventure.2012.S02E47.Dios.World.3.1080p.BluRay.Remux.DTS-HD.MA2.1.H.264-Meakes.mkv', 'file_extension': 'mkv', 'video_term': 'H.264', 'video_resolution': '1080p', 'source': 'BluRay', 'release_information': 'Remux', 'anime_season': '02', 'episode_number': '47', 'anime_title': 'JoJos Bizarre Adventure 2012', 'episode_title': 'Dios World.3'}
This naming style has most of the information delimited by .
, but the release group delimited at the very end by a dash. Ideally, anitopy would correctly return 'release_group':'Meakes'
.
Currently, Anitopy only supports python 3.5 and above.
>>> anitopy.parse('[GM-Team][国漫][诛仙][Jade Dynasty][2022][11][HEVC][GB][1080p]')
{'file_name': '[GM-Team][国漫][诛仙][Jade Dynasty][2022][11][HEVC][GB][1080p]', 'video_resolution': '1080p', 'video_term': 'HEVC', 'anime_year': '2022', 'episode_number': '11', 'anime_title': 'Jade Dynasty', 'release_group': 'GM-Team'}
>>> anitopy.parse('[GM-Team][国漫][诛仙][Jade Dynasty][2022][11][HEVC][GB][4K]')
{'file_name': '[GM-Team][国漫][诛仙][Jade Dynasty][2022][11][HEVC][GB][4K]', 'video_term': 'HEVC', 'anime_year': '2022', 'episode_number': '11', 'anime_title': 'Jade Dynasty', 'release_group': 'GM-Team'}
Missing video_resolution property
Extend the recognition algorithm. Since A lot fansub having this version scheme. Probably if you want to add it as well.
Not work:
- [Judas] Chuunibyou S1 - NCEDv2.mkv # season 1 ending
- File: [Judas] Chuunibyou S1 - S02.mkv # season 1 special eps 2
- File: [Judas] Clannad - S01OVA01.mkv # season 1 ova eps 1
Work:
- [Judas] Chuunibyou S1 - NCED v2.mkv
- [Judas] Chuunibyou S1 - SP02.mkv
- [Judas] Clannad - S01 OVA01.mkv
If you would add this, I can search for the files using those scheme.
For now, i just add the check on mine manually before be parsed by anitopy.
Currently the Elements
and Tokens
classes are singletons. The problem is that we have to be very careful to clean everything between each parse or else state from a previous parse may remain in future parses.
It would be much better if a new instance was created at every parse to ensure total isolation.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.