Felix Dittrich's Projects
SSD: Single Shot MultiBox Detector | a PyTorch Tutorial to Object Detection
Hierarchical Attention Networks | a PyTorch Tutorial to Text Classification
Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Official implementation for ICDAR 2021 paper "Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer"
Containers as a service on AWS
The code for the of book of "Deep-Learning-For-Computer-Vision-With-Python"
PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
The official code for โDeep Unrestricted Document Image Rectificationโ.
this repository contains utils for post processing scanned documents and classify any kind of document for the given labels
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.
short introduction about myself
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Handwriting Synthesis with RNNs โ๏ธ
PyTorch implementations of recent Computer Vision tricks (ReXNet, RepVGG, Unet3p, YOLOv4, CIoU loss, AdaBelief, PolyLoss, MobileOne)
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Model for document segmentation trained on the midv-500-models dataset.