GithubHelp home page GithubHelp logo

democ-de / teledash Goto Github PK

View Code? Open in Web Editor NEW
14.0 5.0 1.0 614 KB

Research and analysis software for Telegram

telegram data-science data-visualization osint text-recognition speech-recognition journalism machine-learning investigative-journalism

teledash's Introduction

Teledash

Research and analysis software for Telegram

Teledash is a web application that simplifies research and analysis of the content on Telegram.

landing page

Repositories

Features

One or more Telegram accounts can be linked to Teledash. The content will be downloaded and processed periodically.

Search

With the help of the web interface, all channels, groups and chats can be searched with various parameters. For example, you can search for messages from a certain period in certain channels or messages from specific users.

Text and speech recognition

Automated text recognition (OCR) is used to recognize and save text on images. In addition, voice messages can automatically be transcribed (ASR) in the background and stored as searchable text. The quality of the results depends strongly on the quality of the audio recording as well as the speech model used. Models for text as well as speech recognition can be manually improved or trained if necessary.

Metrics

Teledash regularly collects statistical data on the activity and growth of channels and groups, enabling quantitative analysis.

Storage

Media such as videos, photos, and voice messages can be automatically downloaded and stored in a MinIO instance or in S3-compatible cloud storage.

Export

All collected data can also be accessed and filtered via a REST API for further processing of the content by third-party software. API endpoints can be tested using Swagger. mongoexport optionally allows the export of complete data sets as CSV or JSON.

Future development

Teledash will be further developed and tested in journalistic and scientific contexts in the future. Feel free to get in touch.

Terminology

  • Chats are groups, supergroups and channels. (Private conversations won't be scraped and stored by teledash).
  • User are users and bots.
  • Messages are messages that contain text or media (attachments)

Citation

Please cite Teledash in your publications if you used it for your research:

@misc{teledash_2022, 
  title={Teledash – analysis and research software for Telegram}, 
  url={https://github.com/democ-de/teledash}, 
  author={Weichbrodt, Gregor and Stanjek, Grischa}, 
  year={2022}
} 

Acknowledgements

  • Funded from September 2021 until February 2022 by logos of the Bundesministerium für Bildung und Forschung (BMBF), Prototype Fund and OKF-Deutschland

teledash's People

Contributors

grischka avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

Forkers

prototypefund

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.