GithubHelp home page GithubHelp logo

vesuvius-first-letters's Introduction

Vesuvius First Letters

This repository contains the second place solution to the Vesuvius First Letters challenge.

The approach uses an I3D architecture to detect ink from within the scrolls

How to use

Installation

I provide an image of my environment in the requirements.txt, I believe only the following packages need to be installed

pytorch-lightning   
typed-argument-parser   
segmentation_models_pytorch   
albumentations   
warmup_scheduler
wandb  

Training

Download the data and appropriate segments (instructions here).
Unzip the labels folder and place each {segmentid}_inklabel.png in its appropriate segment folder.

Adjust the CFG class with your compute in 64x64_256stride_i3d.py, these are the typical configs used in the kaggle competition.

python 64x64_256stride_i3d.py

Inference

the inference script runs a trained model , for more info about the arguments check the InferenceArgumentParser class in the inference script. a pretrained checkpoint is available here

e.g: python inference.py --segment_id 123 --model_path 'model.ckpt'

vesuvius-first-letters's People

Contributors

younader avatar kristopolous avatar

Stargazers

Juan Esteban Arango Ossa avatar Ahmed Gamal avatar Bradley Alan Danuser avatar OR avatar  avatar  avatar  avatar Cogito Ergo Sum avatar Wim Jeantine-Glenn avatar Ali Syed avatar Eric Ye avatar Wona Lee avatar Mehmetcan Budak avatar ReikiReika avatar Hunter Husar avatar Dimitri Copley avatar  avatar  avatar Melanee avatar  avatar FoxHound avatar  avatar  avatar Boris avatar Nikolaus Schlemm avatar Houssem MENHOUR avatar bilzard avatar  avatar  avatar  avatar Aleksander Mako  avatar Lucy  avatar (juli)ane avatar  avatar Amit Karamchandani Batra avatar Mariam Ihab Mohammed avatar  avatar Ignacio P. de Ziriza avatar  avatar  avatar Shao-Qian (SQ) Mah avatar Loreto Parisi avatar Matt Willis avatar David Neumann avatar  avatar  avatar Diego Silva avatar Nullzero avatar VR avatar fakepixels avatar Domenico Corvasce avatar Huang Zeqin avatar Umar Hansa avatar DeLicht avatar  avatar  avatar  avatar Alexander Khlopov avatar Houda Aynaou avatar Egor Lynov avatar Isaka Traore avatar Oleg avatar Paul Praet avatar Roman Ruiz-Esparza avatar  avatar Tobit Flatscher avatar  avatar  avatar Marlene Bültemann avatar  avatar 爱可可-爱生活 avatar Johannes Rudolph avatar Moustafa Nader avatar Charles Francis avatar Mario Lucas da Silva Barbosa avatar Su YR avatar Kamil Sindi avatar Hugo Masclet avatar Michael Feldman avatar Michael Feldman avatar  avatar Pascal Martinolli avatar  avatar Matt Shaffer avatar Eryk Kowalewicz avatar Roman avatar Nero Blackstone​ avatar Jean-Philippe Deblonde avatar Rubén Salvador García San Juan avatar ringsaturn avatar Gurumurthi V Ramanan avatar Emil Juzovitski avatar pushkar avatar Erfan Abedi avatar Nat Friedman avatar Sam Shao avatar John Casey avatar アキラ avatar Sebastian Kasanzew avatar Victor avatar

Watchers

Vésteinn Snæbjarnarson  avatar Stephen Parsons avatar  avatar Moustafa Nader avatar Matt Shaffer avatar

vesuvius-first-letters's Issues

Inferencing

Hi Youssef,

Thank you for providing your code. I tried to regenerate the mask of the segment with ID "20230826135043" using the provided inference script and model checkpoint. I end up with this:
20230826135043_2_15

This looks different than what was posted here. Does the default parameter of the inference script match the model checkpoint?

Training

Hi Youssef,

It seems like you are training the I3d model without pretrained weights based on the 64x64_256stride_i3d.py script and not using the provided fragments from the Kaggle challenge, can you please clarify why?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.