Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/myousefi/bitter-lesson-cvpr
https://github.com/myousefi/bitter-lesson-cvpr
Last synced: 7 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/myousefi/bitter-lesson-cvpr
- Owner: myousefi
- Created: 2024-06-17T18:13:48.000Z (7 months ago)
- Default Branch: master
- Last Pushed: 2024-12-23T03:30:15.000Z (13 days ago)
- Last Synced: 2024-12-23T04:27:37.100Z (13 days ago)
- Language: Python
- Size: 85.9 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# bitter-lesson-cvpr
A data analysis project to evaluate computer vision papers through the lens of Richard Sutton's "Bitter Lesson" principles using LLMs.
## Project Overview
This project analyzes CVPR (Computer Vision and Pattern Recognition) papers to assess their alignment with the principles outlined in Richard Sutton's "The Bitter Lesson". It uses LLM-based evaluation to score papers across multiple dimensions related to scalable machine learning approaches versus human-engineered solutions.
## Key Features
- Paper data collection via Semantic Scholar API
- SQLite database storage for papers and evaluation scores
- Multiple LLM evaluation implementations (GPT-4, Claude)
- Scoring system across multiple bitter lesson dimensions:
- Learning over engineering
- Search over heuristics
- Scalability with computation
- Generality over specificity
- Favoring fundamental principles## Project Structure
bitter-lesson-cvpr/
├── src/
│ └── bitter_lesson_cvpr/
│ ├── llm_evaluation/ # LLM scoring implementations
│ ├── semantic_scholar_api/ # Paper data collection
│ └── utils/ # Helper utilities
├── notebooks/ # Analysis notebooks
├── dbs/ # SQLite database storage## Setup
1. Clone the repository
2. Install dependencies:pip install -r requirements.txt
3. Set up environment variables:
- `MAGENTIC_ANTHROPIC_API_KEY` - Anthropic API key
- `S2_API_KEY` - Semantic Scholar API key
- `OUTPUT_DIR` - Directory for output files## Usage
Collect paper data:
python -m bitter_lesson_cvpr.semantic_scholar_api_call
Run LLM evaluations:
python -m bitter_lesson_cvpr.llm_evaluation.evaluate_abstracts_v2
Generate analysis plots:
python notebooks/plots.py
## Database Schema
The project uses two main tables:
- `papers`: Stores paper metadata and abstracts
- `bitter_lesson_scores_v2`: Stores LLM evaluation scores across different dimensions## Analysis & Visualization
The `notebooks` directory contains Plotly-based visualizations for:
- Temporal trends in bitter lesson alignment
- Score distributions across different dimensions
- Comparative analysis between different LLM evaluators## Citation
@inproceedings{yousefi-collins-2024-learning,
title = "Learning the Bitter Lesson: Empirical Evidence from 20 Years of {CVPR} Proceedings",
author = "Yousefi, Mojtaba and Collins, Jack",
editor = "Peled-Cohen, Lotem and Nitay, Calderon and Lissak, Shir and Reichart, Roi",
booktitle = "Proceedings of the 1st Workshop on NLP for Science (NLP4Science)",
month = nov,
year = "2024",
address = "Miami, FL, USA",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2024.nlp4science-1.15",
doi = "10.18653/v1/2024.nlp4science-1.15",
pages = "175--187",
abstract = "This study examines the alignment of Conference on Computer Vision and Pattern Recognition (CVPR) research with the principles of the {``}bitter lesson{''} proposed by Rich Sutton. We analyze two decades of CVPR abstracts and titles using large language models (LLMs) to assess the field{'}s embracement of these principles. Our methodology leverages state-of-the-art natural language processing techniques to systematically evaluate the evolution of research approaches in computer vision. The results reveal significant trends in the adoption of general-purpose learning algorithms and the utilization of increased computational resources. We discuss the implications of these findings for the future direction of computer vision research and its potential impact on broader artificial intelligence development. This work contributes to the ongoing dialogue about the most effective strategies for advancing machine learning and computer vision, offering insights that may guide future research priorities and methodologies in the field.",
}