An open API service indexing awesome lists of open source software.

https://github.com/fer-aguirre/fer-aguirre

My GitHub profile README
https://github.com/fer-aguirre/fer-aguirre

Last synced: 7 months ago
JSON representation

My GitHub profile README

Awesome Lists containing this project

README

          

## Hi there! 👋

I work at the intersection of social data science and journalism to tell stories with the help of data.

### About me

- 💻 Data analysis at [The Examination](https://www.theexamination.org/)
- 🚀 Computational linguistics enthusiast
- 📌 Always interested in collaborating on data-driven projects
- 📫 How to reach me: feraguirre@riseup.net

---

### Contents

- [NLP](#nlp)
- [Data Analysis](#data-analysis)
- [Data Visualization](#data-visualization)
- [Web Scraping](#web-scraping)
- [Tools](#tools)
- [Project Templates](#project-templates)
- [Learning Resources](#learning-resources)

---

### NLP

| Repository | Description |
| --- | --- |
| [discursos-milei](https://github.com/fer-aguirre/discursos_milei) | Scraper y análisis de discursos de Javier Milei |
| [ai4foia](https://github.com/fer-aguirre/ai4foia) | Proof-of-concept to recommend recipients for FOIA requests |
| [ hackathon-somos-nlp-2023](https://github.com/fer-aguirre/hackathon-somos-nlp-2023) | Fine-tuning LLMs for detecting hate speech categories in Spanish |
| [customized-headlines](https://github.com/fer-aguirre/customized-headlines) | Proof-of-concept to create customized headlines from news content based on demographic data |
| [explained-recommendations](https://github.com/fer-aguirre/explained-recommendations) | API for a system recommendation explained using generative AI |
| [opportunities-db](https://github.com/fer-aguirre/opportunities-db) | Scraper to extract data from opportunity-related websites (e.g. funds, scholarships, etc.) and convert them into structured data |
| [ner-spanish](https://github.com/fer-aguirre/ner_spanish) | A repository for extracting Named Entity Recognition (NER) in Spanish data |
| [pmdm](https://github.com/fer-aguirre/pmdm) | Fine-tuned pre-trained language model that detects hate speech against women in Spanish and Portuguese |
| [attackdetector](https://github.com/JournalismAI/attackdetector) | Research for hate speech on Twitter against journalists and environmental activists in Mexico and Brazil |
| [topicos-discursos-amlo](https://github.com/fer-aguirre/topicos-discursos-amlo) | Analysis with topic modeling to AMLO's speeches |
| [bad-bunny](https://github.com/fer-aguirre/bad-bunny) | Analysis of Bad Bunny's songs |

### Data Analysis

| Repository | Description |
| --- | --- |
| [travesticidios-argentina](https://github.com/DataCritica/travesticidios-argentina) | Data analysis on court decisions on transvesticides in Argentina from 2018 to 2023 |
| [elecciones-argentina-2023](https://github.com/DataCritica/elecciones-argentina-2023) | Data analysis of attacks against journalists in Twitter during the elections in Argentina in 2023 |
| [recomendaciones-escritoras](https://github.com/fer-aguirre/recomendaciones-escritoras) | Recommendation system for Latin American women writers |
| [cancilleria-colombia](https://github.com/fer-aguirre/cancilleria-colombia) | Data analysis of public servants of Foreign Affairs in Colombia |
| [gptzero-ai-articles](https://github.com/fer-aguirre/gptzero-ai-articles) | Data analysis of articles talking about ChatGPT that were created with generative AI models |
| [capir-transfronteriza2-2023](https://github.com/DataCritica/capir-transfronteriza2-2023) | Data analysis and topic modeling of anti-rights groups from Brazil, Ecuador and Colombia |
| [migrantes-desaparecidos-eeu](https://github.com/fer-aguirre/migrantes-desaparecidos-eeuu) | Data analysis on missing migrants en route to the U.S. |
| [covid19-venezuela](https://github.com/fer-aguirre/covid19-venezuela) | Data analysis on covid-19 deaths in Venezuela |
| [violencia-obstetrica-cuba](https://github.com/DataCritica/violencia-obstetrica-cuba) | Data analysis of obstetric violence in Cuba |

### Data Visualization

| Repository | Description |
| --- | --- |
| [ping-pong-caba](https://github.com/fer-aguirre/ping-pong-caba) | Mapa con ubicaciones de mesas de ping pong en lugares públicos de CABA |
| [comision-revision-bolivia](https://github.com/DataCritica/comision-revision-bolivia) | Map showing the rate of femicides in Bolivia per 100,000 women from 2013 to 2020 |
| [escritoras-latinas](https://github.com/DataCritica/escritoras-latinas) | Web scraping of Wikipedia entries for Latin American women writers and network graph visualization |
| [wifi-gratuito-cdmx](https://github.com/fer-aguirre/wifi-gratuito-cdmx) | Map showing locations of public free internet service in Mexico City [ARCHIVED] |
| [mapa-huertos](https://mapa-huertos.vercel.app/) | Map with locations of urban orchards in Mexico City [ARCHIVED] |
| [maps-examples](https://github.com/fer-aguirre/maps-examples) | Maps examples using folium and prettymaps modules in Python [ARCHIVED] |
| [directorix-disidente](https://github.com/fer-aguirre/directorix-disidente) | Digital directory of professions to build networks among the queer community of Mexico City [ARCHIVED] |

### Web Scraping

| Repository | Description |
| --- | --- |
| [cij-argentina](https://github.com/fer-aguirre/cij-argentina) | Scraper to convert PDF files from the CIJ website in Argentina into structured data |
| [pdf-2-ner](https://github.com/fer-aguirre/pdf-2-ner) | Web application to convert scanned PDF files to text-based data and apply Named Entity Recognition (NER) to extract entities in Spanish |

### Tools

| Repository | Description |
| --- | --- |
| [pubmed-scraper](https://github.com/fer-aguirre/pubmed-scraper) | A python command-line tool which scrapes PubMed based on keywords search and URL extraction |
| [oportunidades-perioidstas-latam](https://github.com/fer-aguirre/oportunidades-perioidstas-latam) | Sitio web para difundir oportunidades para periodistas en Latinoamérica |
| [numerical-expressions](https://github.com/fer-aguirre/numerical-expressions) | A python command-line tool which describes the change between two numerical values |
| [data-annotator](https://github.com/fer-aguirre/data-annotator) | Web application for text-based data labeling [ARCHIVED] |

### Project Templates

| Repository | Description |
| --- | --- |
| [cookiecutter-data-analysis-extensive](https://github.com/fer-aguirre/cookiecutter-data-analysis-extensive) | A cookiecutter template for data analysis projects using Python |
| [cookiecutter-data-analysis-lite](https://github.com/fer-aguirre/cookiecutter-data-analysis-lite) | A starter template for data analysis projects that offers a simplified and beginner-friendly structure |
| [cookiecutter-data-journalism](https://github.com/DataCritica/cookiecutter-data-journalism) | A cookiecutter template for data journalism projects using Python |

### Learning Resources

| Repository | Description |
| --- | --- |
| [csvconf-nlp](https://github.com/fer-aguirre/csvconf-nlp) | Sesión de introducción a NLP en la csv,conf,v8 de Puebla, México en 2024 |
| [taller-cookiecutter](https://github.com/fer-aguirre/taller-cookiecutter) | Taller sobre cómo crear plantillas de proyectos para análisis de datos |
| [taller-python](https://github.com/fer-aguirre/taller-python) | Jupyter notebooks for learning the basics of Python |
| [learn-python](https://github.com/fer-aguirre/learn-python) | Collection of Python scripts organized by topics |
| [learn-react-d3](https://github.com/fer-aguirre/learn-react-d3) | Examples for data visualization with React and D3.js |
| [learn-scrollama](https://github.com/fer-aguirre/learn-scrollama) | Examples for scrollytelling with scrollama |
| [twitter-python](https://github.com/fer-aguirre/twitter-python) | Examples for Twitter data collection with Tweepy in Python [ARCHIVED] |