awesome-ai
A curated list of AI tools, courses, books, and resources for anyone interested in exploring artificial intelligence, machine learning, and deep learning.
https://github.com/nickvasdev/awesome-ai
Last synced: 11 days ago
JSON representation
-
Books
-
Advanced AI and Machine Learning Courses
- The Hundred-Page Machine Learning Book - A comprehensive guide to machine learning in just 100 pages.
- Generative AI in Action - Learn how to add generative AI tools for text, images, and code into projects.
-
-
Courses
-
Advanced AI and Machine Learning Courses
- Knowledge Based Artificial Intelligence - Georgia Tech's course focusing on Symbolic AI.
- Deep RL Bootcamp Lectures - Deep Reinforcement Bootcamp Lectures - August 2017.
- Elements of AI - An introduction to AI for everyone interested in learning what AI is and how it affects our lives.
-
Deep Learning and Reinforcement Learning
- Deep Learning - Introductory course to deep learning using TensorFlow.
- Stanford Statistical Learning - Introductory course on machine learning.
-
Introductory Courses
- Introduction to Artificial Intelligence (AI) - High-level introduction to AI from IBM on Coursera.
- Introduction to Generative AI - Beginner-level introduction to Generative AI from Google on Coursera.
-
-
Learning Resources
-
Awesome GitHub Resources
- Awesome Graph Classification - Learning from graph-structured data.
- Awesome Fraud Detection Papers - Fraud detection papers from machine learning conferences.
-
Online Resources
- Awesome Machine Learning - Curated list of machine learning resources.
- Awesome Deep Learning Resources - Rough list of deep learning resources.
- Professional and In-Depth AI Video Courses - Free professional AI tutorials and courses.
- Harvard: Machine Learning Systems
- Deep Learning AI: Introduction to On-Device AI
-
-
Newsletters
-
Awesome GitHub Resources
- Superhuman.ai - A daily AI newsletter.
-
-
On-Device AI
-
Efficient Architectures for On-Device LLMs
- MobileLLM - High accuracy, optimized for sub-billion parameter models, embedding sharing, grouped-query attention, reduced model size.
- EdgeShard - Up to 50% latency reduction, collaborative edge-cloud computing, optimal shard placement, distributed model components reduce individual device load.
- LLMCad - Up to 9.3× speedup in token generation, generate-then-verify, token tree generation, smaller LLM for token generation, larger LLM for verification.
- Any-Precision LLM - Supports multiple precisions efficiently, post-training quantization, memory-efficient design, substantial memory savings with versatile model precisions.
- Breakthrough Memory - Up to 4.5× performance improvement, PIM and PNM technologies enhance memory processing, enhanced memory bandwidth and capacity.
- MELTing Point - Provides systematic performance evaluation, analyzes impacts of quantization, efficient model evaluation, evaluates memory and computational efficiency trade-offs.
- LLMaaS on device - Reduces context switching latency significantly, stateful execution, fine-grained KV cache compression, efficient memory management with tolerance-aware compression and swapping.
- LocMoE - Reduces training time per epoch by up to 22.24%, orthogonal gating weights, locality-based expert regularization, minimizes communication overhead with group-wise All-to-All and recompute pipeline.
- EdgeMoE - Significant performance improvements on edge devices, expert-wise bitwidth adaptation, preloading experts, efficient memory management through expert-by-expert computation reordering.
- JetMoE - Outperforms Llama27B and 13B-Chat with fewer parameters, reduces inference computation by 70% using sparse activation, 8B total parameters, only 2B activated per input token.
- Pangu-$\pi$ Pro - Neural architecture, parameter initialization, and optimization strategy for billion-level parameter models, embedding sharing, tokenizer compression, reduced model size via architecture tweaking.
- Zamba2 - 2x faster time-to-first-token, a 27% reduction in memory overhead, and a 1.29x lower generation latency compared to Phi3-3.8B, hybrid Mamba2/Attention architecture and shared transformer block, 2.7B parameters, fewer KV-states due to reduced attention.
-
Evolution of On-Device LLMs
- Tinyllama - Open-source small language model.
- MobileVLM V2 - Faster and stronger baseline for Vision Language Model.
- MobileAIBench - Benchmarking LLMs and LMMs for on-device use cases.
- Octopus series papers - On-device language models for different applications. [[Octopus v2]](https://arxiv.org/abs/2404.01744) [[Octopus v3]](https://arxiv.org/abs/2404.11459) [[Octopus v4]](https://arxiv.org/abs/2404.19296) [[Github]](https://github.com/NexaAI).
- The Era of 1-bit LLMs - All large language models are in 1.58 bits.
- AWQ - Activation-aware weight quantization for LLM compression and acceleration. [[Github]](https://github.com/mit-han-lab/llm-awq).
- Small Language Models - Survey, measurements, and insights.
-
General Efficiency and Performance Improvements
- Any-Precision LLM - Low-cost deployment of multiple, different-sized LLMs. [[Github]](https://github.com/SNU-ARC/any-precision-llm).
- On the Viability of Using LLMs for SW/HW Co-design - An example in designing CIM DNN accelerators.
-
Limitations of Cloud-Based LLM Inference and Advantages of On-Device Inference
- Ferret-v2 - An improved baseline for referring and grounding with large language models.
- Phi-3 Technical Report - A highly capable language model locally on your phone.
- Exploring post-training quantization - Comprehensive study to low rank compensation.
- Matrix compression - Randomized low rank and low precision factorization. [[Github]](https://github.com/pilancilab/matrix-compressor).
-
LLM Architecture Foundations
- The case for 4-bit precision - k-bit inference scaling laws.
- Challenges and applications of large language models
- MiniLLM - Knowledge distillation of large language models. [[Github]](https://github.com/Tebmer/Awesome-Knowledge-Distillation-of-LLMs).
- Gptq - Accurate post-training quantization for generative pre-trained transformers. [[Github]](https://github.com/IST-DASLab/gptq).
- Gpt3.int8() - 8-bit matrix multiplication for transformers at scale.
-
Memory and Computational Efficiency
- Breakthrough Memory Solutions - Improved performance on LLM inference.
-
On-Device LLMs Training
- OpenELM - An efficient language model family with open training and inference framework. [[Github]](https://github.com/apple/corenet).
-
The Performance Indicator of On-Device LLMs
- MNN - A lightweight deep neural network inference engine.
- PowerInfer-2 - Fast large language model inference on a smartphone. [[Github]](https://github.com/SJTU-IPADS/PowerInfer).
- llama.cpp - Lightweight library for approximate nearest neighbors and maximum inner product search.
- Powerinfer - Fast large language model serving with a consumer-grade GPU. [[Github]](https://github.com/SJTU-IPADS/PowerInfer).
-
-
Tools
-
Chat
-
Commercial Tools
- Taskade - Build, train, and deploy AI agents to automate tasks, research, and collaborate in real-time.
-
Images
- Midjourney - AI image generation.
- DALL·E 3 - AI system that creates realistic images and art from a natural-language description.
-
Video
-
-
Videos
-
Advanced AI and Machine Learning Courses
- The Unreasonable Effectiveness Of Deep Learning - Yann LeCun gives a talk on deep convolutional neural networks.
- AWS Machine Learning in Motion - Learn how to build a predictive algorithm using AWS.
- Reinforcement Learning in Motion - Concepts like how RL systems learn and how to train AI agents.
-
Sub Categories
Efficient Architectures for On-Device LLMs
12
Advanced AI and Machine Learning Courses
8
Evolution of On-Device LLMs
7
Online Resources
5
LLM Architecture Foundations
5
The Performance Indicator of On-Device LLMs
4
Limitations of Cloud-Based LLM Inference and Advantages of On-Device Inference
4
Awesome GitHub Resources
3
Introductory Courses
2
Deep Learning and Reinforcement Learning
2
Video
2
General Efficiency and Performance Improvements
2
Chat
2
Images
2
Memory and Computational Efficiency
1
Commercial Tools
1
On-Device LLMs Training
1
Keywords
deep-learning
3
machine-learning
2
graph-classification
2
graph-representation-learning
1
graph-kernels
1
graph-kernel
1
graph-embedding
1
graph-convolutional-networks
1
graph-attention-networks
1
graph-attention-model
1
deepwalk
1
deep-graph-kernels
1
classification-algorithm
1
attention-mechanism
1
winograd-algorithm
1
vulkan
1
mnn
1
ml
1
embedded-devices
1
deep-neural-networks
1
convolution
1
arm
1
tensorflow
1
lstm
1
cnn
1
awesome-list
1
random-forest
1
logistic-regression
1
link-prediction
1
gradient-boosting
1
fraud-prevention
1
fraud-management
1
fraud-explorer
1
fraud-detection
1
fraud-checker
1
data-science
1
data-mining
1
credit-scoring
1
credit-card-validation
1
credit-card-fraud-detection
1
credit-card-fraud
1
classifier
1
classification
1
churn
1
weisfeiler-lehman
1
structural-attention
1
node2vec
1
node-embedding
1
network-embedding
1
netlsd
1