GithubHelp home page GithubHelp logo

Hi! 👋 I'm FrankZxShen

🚀 About me

I am a student majoring in artificial intelligence in Southwest Jiaotong University.

ZxShen's Projects

atla-demo icon atla-demo

Source code for "Adversarial Training for Layout-Aware Text-VQA".

ats icon ats

[ICME 2024] The code for Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering

echarts icon echarts

Apache ECharts is a powerful, interactive charting and data visualization library for browser

efficientzero icon efficientzero

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021. Optimize the residual module

grasscutter icon grasscutter

A server software reimplementation for a certain anime game.

latla icon latla

LLM portion of ATLA. Used to bring llama2 external knowledge into Text-VQA.

latr icon latr

Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answering (STVQA)

mnlexnet icon mnlexnet

This is the PyTorch version repository for MNIST dataset identification.

tap icon tap

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral):Add prompt for LLM.

terminal icon terminal

The new Windows Terminal and the original Windows console host, all in the same place!

vits-fast-fine-tuning icon vits-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.