Comments (7)
Thank you as always!
While I have your attention, have you tried the OCR feature?
I have not. School's been a bit hectic for me lately and this is the first time I've used Memento in a while. You can tell because otherwise surely I'd encounter this bug earlier. But I'll try it out soon and think about it.
from memento.
The fix will be up on master shortly. As always, thanks for the report.
While I have your attention, have you tried the OCR feature? If you have, what are you thoughts? I really want to figure out what to do with it in the next release.
from memento.
Don't worry about it. No rush.
from memento.
I saw that
Any problems you may have getting manga-ocr installed using pip is beyond the scope of this project. I wish you the best of luck.
But I am still wondering if I can have some pointers. I am getting the ModuleNotFoundError: No module named 'manga_ocr'
error as mentioned in https://github.com/ripose-jp/libmocr README, but I know nothing about the build system to specify the Python version for building libmoc
as a dependency of Memento.
On this specific system I should only have one installation of Python3.10 via homebrew, as Apple removed the default Python in macOS 12.3, so still getting this error was surprising to me.
from memento.
That's strange. I didn't do anything special to install it on macOS. I just installed manga-ocr
via pip and it just worked. You could try forcing CMake to link to Python 3.10 by passing -DEXACT_PYTHON_VERSION=3.10
in the CMAKE_ARGS
environment variable.
The problem is that the version of Python that Memento is linked to isn't the same version as the one with manga-ocr
installed. There are a ton of reasons as to possibly why:
- Multiple installed Python versions and Memento is linked to the wrong one
- Some weird configurations with virtual environments
- Something with envvars like
PYTHON_HOME
being set wrong (not likely)
I couldn't tell you which.
from memento.
Thank you. I've been able to build Memento with OCR support. So far it the functionality works amazing. Currently though Memento does not seem to support associating an OCR (or manual lookup) result with a certain stretch of audio in the absence of a subtitle file, which feels like neglecting Memento's most important feature.
I think it would be great if Memento allows the user to choose a part of the current video as the current context. Some DWIM behavior I imagine is:
- allow the user to continually mark points in the video with a hot key
- when adding a card:
- if there are no points past current playback time: extract context between the last two set points
- if current playback time is between two points: extract context between the two points
- when adding a card:
Also this might not be possible, but I wonder if Memento could use the timing information from sub-seek
to determine the current {sentence}
. This feels inflexible and might be expensive, so I don't really know.
from memento.
Glad that you got it working. Since you're making a feature request, could you open a new issue so I can track it? You can copy and paste it.
My only major question regarding the feature is what is the use case? It makes sense to me that manual search may be associated with an audio clip if raster subtitles were OCR'd. Outside of that, OCR will usually be associated with visual context as opposed to audio context.
The reason I wanted to add OCR in the first place was due to Evangelion episode 14 using cards of text throughout to communicate information. The second use case I found after implementing it was using this script to turn Memento into a manga reader. For both of these cases, I don't see the benefit of extracting audio from the content.
from memento.
Related Issues (20)
- MacOS - Please ensure your Linux system has 64 bit binary support HOT 3
- Script doesn't work when launching memento but works from command line HOT 6
- Cannot create card from sentences with <> between them HOT 2
- Size and quality options for Anki screenshots. HOT 8
- Bad Video Playback Performance (on Mac) HOT 7
- Fails to populate {audio-media} HOT 8
- Memento cannot tell if a word is in your deck if added externally HOT 4
- Changing font size for secondary subtitle track HOT 1
- Keep session and autoplay last played video HOT 6
- Other languages HOT 2
- Unpause video when cursor is away from subtitles HOT 4
- Duplicate card logic has false positives with pure hiragana words
- Modifier keys don't behave correctly on macOS HOT 3
- Sometimes fails to populate {audio-media}
- Auto text edit field focus HOT 4
- Looking up Bug HOT 1
- [ytdl_hook] ERROR: Unable to extract uploader id; please report this issue on https://yt-dl.org/bug HOT 8
- Inconsistent ability to add Anki notes HOT 8
- there is build error when compiling , please help. HOT 2
- Search Within Subtitle List HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from memento.