Useful Links

AI Blogs

The following leading company's blogs are great resources to stay updated on the latest AI advancements from these leading companies.

Google AI Blog
Google is known for its constant innovation, and their AI Blog is a great source to stay updated on their latest advancements. They explain their ideas in a professional manner.
Google Research Publication Database
Academic papers focused on AI from Google Research.
DeepMind Research Blog
DeepMind operates independently within Google and has made significant breakthroughs like AlphaGo and AlphaFold.
Meta AI Blog
Meta's AI publications and blog posts covering various AI research topics.
Microsoft Research AI Publications
High-level AI publications from Microsoft Research.
Stanford AI Lab (SAIL) Blog
Stanford University's AI blog featuring work from notable figures like Andrew Ng and Yoav Shoham.
MIT News on Artificial Intelligence
MIT's AI research news and developments.
MIT Technology Review - AI
AI-focused articles providing a TechCrunch-like perspective on AI developments.

Language Model Papers

22 research papers you should read to master Language modeling.

A Mathematical Theory of Communication
Foundational paper in information theory by Claude Shannon.
A Neural Probabilistic Language Model
Early neural language model by Bengio et al.
NLP (Almost) from scratch
Collobert et al.'s unified neural network approach to NLP.
Phoneme Recognition using Time Delay Neural Networks
Early TDNN application to speech recognition.
Efficient Estimation of words in vector space (Word2Vec)
Mikolov et al.'s Word2Vec: CBOW and Skip-gram models.
GloVe: Global Vectors for Word Representations
Global vector approach to word embeddings.
Enriching word vectors with subword information (FastText)
FastText paper on subword information for word representations.
A Convolution Neural Network for modeling sentences
CNN-based approach to sentence modeling.
Learning Internal Representations by error propagation
Classic PDP chapter on backpropagation.
Sequence Modeling (from the deep learning book)
Deep learning book chapter on sequence modeling.
Long Short Term Memory (LSTM)
Original LSTM paper by Hochreiter and Schmidhuber.
Colah's blog to understanding LSTM
Excellent visual explanation of LSTM networks.
Training Recurrent Neural Networks (PhD Thesis)
Sutskever's comprehensive PhD thesis on RNN training.
Deep contextualized word representations (ELMo)
ELMo paper on contextualized word representations.
Attention is all you need (Transformer)
Foundational Transformer paper.
Bidirectional Encoder Representations from Transformers (BERT)
BERT paper introducing bidirectional transformer representations.
Improving language understanding by generative pretraining (GPT-1)
Original GPT paper on generative pretraining.
Language Models are multi task learner (GPT-2)
GPT-2 paper on unsupervised multitask learning.
Language models are few shot learners (GPT-3)
GPT-3 paper on few-shot learning capabilities.
Sentence BERT
Sentence embeddings using Siamese BERT networks.
ChatGPT Blog
OpenAI's blog post on ChatGPT.
LlaMA-2
Meta's LLaMA-2 technical paper.

Python 3 & Development Tools

Google Colab
Colaboratory is a hosted Jupyter notebook environment that is free to use and requires no setup.
Google Colab Tutorial for Beginners
YouTube: Google Colab For Complete Beginner's | 2023
Python for Beginners (YouTube)
Introduction to Jupyter Notebooks
Set-up, user-guide, and best practices for Jupyter Notebooks.
Object Oriented Programming with Python - Full Course for Beginners
Download Anaconda
Comprehensive Python distribution for data science and machine learning.
Jupyter Lab
Easy to use environment that combines Python, Graphics and Text.
PyCharm IDE
Professional Python IDE with advanced features for development.

PyTorch

Mathematics

Introduction to Linear Algebra for Applied Machine Learning with Python
Mathematics for Data Science Basics
MIT OpenCourseWare:
- Highlight of Calculus by Gilbert Strang
- Linear Algebra by Gilbert Strang
- Differential Equations and Linear Algebra
- Introduction to Probability by John Tsitsiklis
- Signals and Systems by Alan V. Oppenheim

EE5438 Applied Deep Learning

Useful Links

AI Blogs

Language Model Papers

Python 3 & Development Tools

PyTorch

Mathematics