As a data journalist, I focus on data-driven investigations that expose abuses of power. My work includes scraping and cleaning data, creating data memos, conducting research and fostering understanding of data work within the team.
Repository |
Description |
discursos-milei |
Scraper y análisis de discursos de Javier Milei |
ai4foia |
Proof-of-concept to recommend recipients for FOIA requests |
hackathon-somos-nlp-2023 |
Fine-tuning LLMs for detecting hate speech categories in Spanish |
customized-headlines |
Proof-of-concept to create customized headlines from news content based on demographic data |
explained-recommendations |
API for a system recommendation explained using generative AI |
opportunities-db |
Scraper to extract data from opportunity-related websites (e.g. funds, scholarships, etc.) and convert them into structured data |
ner-spanish |
A repository for extracting Named Entity Recognition (NER) in Spanish data |
pmdm |
Fine-tuned pre-trained language model that detects hate speech against women in Spanish and Portuguese |
attackdetector |
Research for hate speech on Twitter against journalists and environmental activists in Mexico and Brazil |
topicos-discursos-amlo |
Analysis with topic modeling to AMLO's speeches |
bad-bunny |
Analysis of Bad Bunny's songs |
Repository |
Description |
ping-pong-caba |
Mapa con ubicaciones de mesas de ping pong en lugares públicos de CABA |
comision-revision-bolivia |
Map showing the rate of femicides in Bolivia per 100,000 women from 2013 to 2020 |
escritoras-latinas |
Web scraping of Wikipedia entries for Latin American women writers and network graph visualization |
wifi-gratuito-cdmx |
Map showing locations of public free internet service in Mexico City [ARCHIVED] |
mapa-huertos |
Map with locations of urban orchards in Mexico City [ARCHIVED] |
maps-examples |
Maps examples using folium and prettymaps modules in Python [ARCHIVED] |
directorix-disidente |
Digital directory of professions to build networks among the queer community of Mexico City [ARCHIVED] |
Repository |
Description |
cij-argentina |
Scraper to convert PDF files from the CIJ website in Argentina into structured data |
pdf-2-ner |
Web application to convert scanned PDF files to text-based data and apply Named Entity Recognition (NER) to extract entities in Spanish |
Repository |
Description |
csvconf-nlp |
Sesión de introducción a NLP en la csv,conf,v8 de Puebla, México en 2024 |
taller-cookiecutter |
Taller sobre cómo crear plantillas de proyectos para análisis de datos |
taller-python |
Jupyter notebooks for learning the basics of Python |
learn-python |
Collection of Python scripts organized by topics |
learn-react-d3 |
Examples for data visualization with React and D3.js |
learn-scrollama |
Examples for scrollytelling with scrollama |
twitter-python |
Examples for Twitter data collection with Tweepy in Python [ARCHIVED] |