GithubHelp home page GithubHelp logo

sleepingsaint / xwindows Goto Github PK

View Code? Open in Web Editor NEW
4.0 1.0 0.0 21 KB

Linux python client for Xserver to access windows information and automate keypresses

License: MIT License

Python 100.00%
xlib xserver linux automation

xwindows's Introduction

XWindows

Linux python client for Xserver to access windows information and automate keypresses

Table of contents


About the package

I created this package mainly to solve some problems I faced using opencv on linux.

Problems


  • Obtaining Window screen in fast and efficient way
    • Every time we want to capture screen we have to take screen shot with 3rd party packages like pyautogui and then convert into opencv image which is both slow and inefficient.
  • No modules like pywin32 for linux to access internal apis

Solutions


  • This package provides native Xserver interaction from python using python-xlib package.
  • This package provides the following features
    • Get active windows (active means all the visual application windows)
    • Get geometry of a particular window (height, width, x, y coordinates)
    • Get screenshot of a particular window in raw bytes or PIL image format
    • Automate keypresses

Since all these operations are done using native apis you will experience increase in performance

Resources

  • native xlib documentation
    • This documentation is written for C / C++ language and you need to have a bit of knowledge to understand this documentation.
    • No availability of lot of examples makes it even harder for beginners to understand the documentation.
  • python-xlib documentation
    • The documentation is still not completed but the devs are updating it continuosly.
  • My Notes
    • I have noted down some important topics to understand XServer and xlib library documentation. Hope it helps :)

Documentation

Installation

pip install XWindows

If you use python3, then use following command

pip3 install XWindows

Importing

import XWindows

# If you want to import seperate components

from XWindows import windows
from XWindows import keys

# If you want to import seperate components and functionality
# This way of importing is preffered cause you don't have to add whole namespace again and again
# This will be used to explain the features further

from XWindows.windows import Windows
from XWindows.keys import Keys

Initialising

# intialising windows object
windows = Windows()

# dispose windows object
windows.dispose()

Methods

Get Active windows

  • Here active indicates the visual application windows
activeWindows = windows.getActiveWindows()
print(activeWindows)

# The result will be list of tuples of (window_id, window_title)
# [
#   (29360135, 'firstPy'), 
#   (33554442, 'python3'), 
#   (39845895, b'windows.py - firstPy - Visual Studio Code'), 
#   (39845920, b'test.py - XWindows - Visual Studio Code'), 
#   (52428803, b'Linux get Window (python) \xe2\x80\x94 Mozilla Firefox')
# ]

# These window_id's change everytime you close and open a particular window

Get Window geometry

# Syntax windows.getWindowGeometry(window_id)

geometry = windows.getWindowGeometry(29360135)
print(geometry)

# returns a tuple of (x, y, width, height)
# (302, 141, 1375, 691)

Get Window Screen

# syntax: windows.getWindowScreen(window_id, bytes_format=False)
# syntax: windows.getWindowScreen(window_id, bytes_format=True) for raw data

# returns image in PIL Image format
screen = windows.getWindowScreen(29360135)

# returns tuple of (image in bytes, image width, image height)
screen_bytes_format = windows.getWindowScreen(29360135, bytes_format=True)

Get Window Slice

  • Use this to get certain portion of a screen
# syntax: windows.getWindowSlice(window_id, x, y, width, height)
# syntax: windows.getWindowSlice(window_id, x, y, width, height, bytes_format=True)

window_id = 29360135
(x, y, width, height) = windows.getWindowGeometry(window_id)

# returns data in PIL Image format
window_slice = windows.getWindowSlice(window_id, x, y, width, height)

# return image data in bytes format
window_slice_bytes_format = windows.getWindowSlice(window_id, x, y, width, height)

Press Key

# syntax: windows.pressKey([keysyms,], window_id)
# all keysyms are accessible using windows.keys

# this below sends A keys pressed signal to window with id window_id
windows.pressKey([windows.keys.A], window_id)
  • [keysyms,] - list of keys you want to simulate
  • send the keys in the order you would press in keyboard
  • for example [windows.keys.A,] = [windows.keys.Shift_L, windows.keys.a]

Author

sleepingsaint @github

Contributions or PR's of any kind are appreciated

Features Planned to add

[ ] Automate mouse movements
[ ] Get windows with title rather than window_id

xwindows's People

Contributors

sleepingsaint avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.