ianzhao05 / textshot Goto Github PK

View Code? Open in Web Editor NEW

1.7K 1.7K 259.0 63 KB

Python tool for grabbing text via screenshot

License: MIT License

AutoHotkey 3.32% Python 96.68%

ocr ocr-recognition python python-3 python-script python3 screenshot script tesseract tesseract-ocr

textshot's People

Contributors

Stargazers

Watchers

Forkers

rhurta krithikvaidya haoict leonardofreua sayedsahbeni behyeefatt techiewasp shshahab hanzcoder cosito-bonito blongcha mewiecat halfk1ng schwenkd alanbosco 0xflotus kaimunchi hadryan cy-dev-tex brunorochax tianlajiangzhaji smart-patrol trendingtechnology krzemienski ioplock-zz aidev42 dataorz thatpolishboy13 jorgeavilacartes hybridego nbswords adwinwhite xrosliang hidannyxu rainly cv-ip zijin2 7more0 leuojn leedaga lfs119 huangshizhi 76782875 williamrjw yuhonghong95721 d-danielyang cqray1990 eujenz gaosq0604 spencertruett shivamnamdeo0101 jackhappy lthomiso binalmehta hasantahir aryansharmaa fakegit kingctan thehornydaddy ramblingnetworklife jkruigu chagge wming404 makarbaderko myccfoo xinxi-blip ashu-cybertron xingcxb tangli-1987 t0ny1974 ashyglim kousun henuguyu sunqiang25 mayurmorin nishanthkadapakonda lunker2019 rkrishna116 caswml ycj0808 osamafrougi ashimroy88 mukeshkumar2617 fengtaijun rajivnr yanjing2407 ashkin2 shanhedian2017 stuti24m bid-tools cimszw komal7209 lijiasheng1984 rus0wes simstems 1164513233 zxstar7789 youtang1993 zhongqianli alading241

textshot's Issues

Doesn't support MacBook Fullscreen

This tool could only take a shot on its current desktop. However, MacBook has a multi-desktop feature, and you can't ask this tool to take a shot on the desktop where the terminal is opened. Hope the author can support multi-desktop screenshots.

Screen turns black when opening textshot

When I run textshot opposite this problem. I show in video. Sorry bad english if I mistake anywhere.

simplescreenrecorder-2022-08-31_01.18.00.mp4

macOS Big Sur opens new screen

I just downloaded text shot on my Mac and installed all the dependencies but have been experiencing this weird behavior where as soon as I run it it will open a new screen to the right with no open apps and would only allow me to screenshot there. Did anyone else encounter this or have a fix?

2021-02-27 19:08:54.739 Python[952:13732] ApplePersistenceIgnoreState: Existing state will not be touched. New state will be written to /var/folders/ld/wjmpqdpj1pq2j4j_svh1j8740000gn/T/org.python.python.savedState

Segmentation fault, (core dumped)

I tried to run textshot on my fedora Linux machine and I got a segmentation fault error:
Traceback (most recent call last): File "/home/maerqin/PycharmProjects/Screenshot_To_Text/venv/lib/python3.12/site-packages/textshot/textshot.py", line 11, in <module> from .logger import log_copied, log_ocr_failure ImportError: attempted relative import with no known parent package [1] 82030 segmentation fault (core dumped) python textshot.py -h

ERROR: Unable to read text from image, did not copy

On recent Arch linux with i3wm window manager I often get ERROR: Unable to read text from image, did not copy.

I have :

python-pyqt5 : 5.15.8
python-pyqt5-sip: 12.11.1
python-pillow: 9.4.0
python-pytesseract: 0.3.10

I'm too dumb and autistic to make this thing work pls help

Hello the issue is my brain, I can't make it work pls help.
Basically, I can make textshot work via cmd but am too dumb to understand the greatness of your coding skill and btw how autokhey works.
Pls help

screenshot "E:\>cd github" output -->"INFO: Copied "AR" to the clipboard"

screenshot "E:>cd github" output -->"INFO: Copied "AR" to the clipboard"

the command line test info:

E:\github>cd..

E:>cd github

E:\github>cd textshot

E:\github\textshot>python textshot.py
**INFO: Copied "E:\github>cd. .
AR

E:\github>cd textshot" to the clipboard**

E:\github\textshot>python textshot.py chi_sim
**INFO: Copied "ET
E:N>cd github

E:Ngithub>cd textshot" to the clipboard**

E:\github\textshot>

can't work when using multi screen

I use two monitors , this program can't work

Does this repo support Chinese?

Doesn't work in multi-monitor setup

I have two screens (let's name them Main and Side). When I open type textshot in a terminal in Side, the Main monitor starts mirroring Side monitor's content.

So, to copy text from Main, I have to open the terminal in Main. This is not a good experience

License

Please add a license file.

Consider adding pyproject.toml for package installation

Here is a pyproject.toml example

[build-system]
requires = ["hatchling"]
build-backend = "hatchling.build"

[project]
name = "textshot"
version = "0.0.1"
authors = [
  { name="Ian ianzhao05", email="[email protected]" },
]
description = "Python tool for grabbing text via screenshot"
readme = "README.md"
requires-python = ">=3.7"
classifiers = [
    "Programming Language :: Python :: 3",
    "License :: OSI Approved :: MIT License",
    "Operating System :: OS Independent",
]
dynamic = ["dependencies"]

[project.urls]
"Homepage" = "https://github.com/ianzhao05/textshot"
"Bug Tracker" = "https://github.com/ianzhao05/textshot/issues/"

[project.scripts]
textshot = "textshot.textshot:main"

[tool.setuptools.dynamic]
dependencies = {file = ["requirements.txt"]}

[tool.setuptools.packages.find]
where = ["textshot"]

It may need some tweaking and modifying the project to relocate python files in textshot and changing the way import is done by using something like from .ocr import … for example.

It would be helpful to make a working package for linux distributions.

You then build and install the package with:

python3 -m build --wheel
python -m installer dist/*.whl

You need to install AutoHotkey, which can be found at https://www.autohotkey.com/

Cropping depends on screen resolution

Cropping is incorrect at times. It depends on screen resolution. I have a 4k display and it wasn't performing as expected

how to get facebook information

Error

ERROR: An error occurred when trying to process the image: (1, "Tesseract Open Source OCR Engine v3.05.00dev with Leptonica read_params_file: Can't open txt Warning in pixReadMemPng: work-around: writing to a temp file libpng warning: Application built with libpng-1.4.3 but running with 1.5.14 Error in pixReadStreamPng: png_ptr not made Error in pixReadMemPng: pix not read Error in pixReadMem: png: no pix returned Error during processing.")

Failed when have multiple monitors

Once I unplugged from external monitors it worked well. The problem is raised from line 29 in textshot.py,
self.screen = QtWidgets.QApplication.screenAt(QtGui.QCursor.pos()).grabWindow(0)

My system is MacOS 10.15.3, with python 3.6.9. Thank you.

Windows defender started recognizing executable file from AutoHotkey script as trojan.

Anyone encountered this phenomena?

[Feature suggestion] Add an auto magnification

Sometimes the target is too small on the screen and I can't capture it accurately. Maybe it is a good idea to add a magnified image based on what is around the cursor when users are capturing the screen.

Not considering dpi scaling results in wrong positions of start and end points on Linux

Here is an example.
The positions captured by the program:

start:1227,695
end:1272,715

Their real positions:

start:3681,2085
end:3816,2145

Thus pyscreenshot grabs the wrong image.

My scale factor:

GDK_DPI_SCALE=0.333
GDK_SCALE=3
QT_AUTO_SCREEN_SCALE_FACTOR=0
QT_SCREEN_SCALE_FACTORS=eDP1=3;DP1=3;DP2=3;HDMI1=3;HDMI2=3;VIRTUAL1=3;

能不能集成中文的OCR识别能力呀？

好项目呀，感觉很实用，不过我看现在应该是只有英文，之前chineseocr有17M中文识别模型模型，
还有最近百度飞桨新发的，https://github.com/PaddlePaddle/PaddleOCR 只有9M的模型，效果好像还更好一些，
不知道几位大佬，最近有没有计划把中文识别能力集成进去呀？

Produces a grey screen when called

Everything works except the starting window when textshot is called, which is fully grey.
OS : Manjaro Linux KDE

Text not being copied to clipboard and sometimes words are translated to french language !

I have created a shortcut in my Ubuntu for textshot and whenever I use it (some times not all the time) the text is copied in French and Not being copied to clipboard at all(this is main issue), I knew it was in French because it was shown in notification :/

Hotkey opening failed, but can be performed in CMD

Use the textShot.ahk script that comes with it
Please guide

Windows defender started recognizing executable file from AutoHotkey script as trojan.

could not work on MacOS

on Mac get

INFO: Unable to read text from image, did not copy

seems pyperclip do not work properly on MacOS.

textshot on macOS Big Sur

Hi,

On macOS 11.1, invoking python textshot.py throws a Qt GUI error:

QPixmap::fromImage: QPixmap cannot be created without a QGuiApplication
QPixmap: Must construct a QGuiApplication before a QPixmap

Any suggestions? This worked fine before updating to Big Sur. Thank you!

EDIT: This has been tried with a virtual environment.

Added your package to arch linux aur

Just a friendly heads up that I've added your package to the archlinux aur and it will keep itself updated based on the latest git commits to the github repo.
https://aur.archlinux.org/packages/textshot-git

So for arch users it's as easy as installing textshot-git with their favourite aur helper.
yay -S textshot-git

Also is there a way to make textshot pause the desktop (animations like the gif on this page)?
Currently it keeps on animating while in box select mode.

Thanks for the great tool!

Tesseract Process Timeout

Appears to not work on more than 5 words at a time, presents with error

"TextShot"
"An error occurred when trying to process the image: Tesseract process timeout"

Size limitation for text, and sometimes prior OCR conversion in clipboard is not replaced with new OCR conversion

Hi
I think this is a really cool idea to make OCR simple to do and allows for correcting OCR mistakes very easily.
I am on a Windows machine and I find that I need to OCR a large text image in parts because it doesn't handle
a lot of text well. Is there a recommended maximum amount of text that should be selected for conversion?
But even doing OCR in parts, some areas appear to be captured and the "spinning wheel" indicates that
a conversion is being done. But when pasting the text that is in the clipboard to notepad++, it is the text from a prior conversion.
If there is an error in the conversion process, I can't find where it is displayed. Can you please give me some pointers
on getting around these issues?
Thanks!

Selection not from upper left corner

When I make a selection starting not from the upper left corner, but from any other corner, instead of the text in selection it returns some long random text, which seems to be the text from all screen.

To reproduce, make a selection of some text from, for example, lower right corner to upper left corner.

2022-02-09.11.12.21.mp4

doesn't work in macos

很遗憾

Does not pause desktop while snipping

The screen overlay does not pause the desktop while snipping. For example, videos and GIFs continue to play in the background. This is inconsistent with Windows's screenshot tools, Snipping Tool and the newer Snip & Sketch.

Raised by @rigred in #12

ianzhao05 / textshot Goto Github PK

textshot's People

Contributors

Stargazers

Watchers

Forkers

textshot's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs