Media

From Kumarpedia, the free encyclopedia

This article covers lectures and educational media by Aneesh Kumar. For technical projects, see Projects.

AI Club Lectures

5 slideshows

Can Machines Think?

6 slides

A lecture on machine understanding through John Searle's Chinese Room, with attention to syntax, semantics, language models, and apparent understanding.

Classical AI

3 slides

A lecture on symbolic AI, rational agents, state representations, actions, goals, rewards, and rule-based reasoning, illustrated through game-playing agents.

Platonic Representation Hypothesis

6 slides

A lecture on convergent representations in AI systems, connecting Plato's Theory of Forms, world modeling, latent spaces, and objective structure.

AA-Omniscience

9 slides

A paper presentation on factual recall and knowledge reliability in language models, covering AA-Omniscience, grading methods, results, and benchmark limitations.

Chain-of-Thought

6 slides

A research presentation on chain-of-thought prompting, reasoning traces, model mechanisms, and variants including self-consistency, tree-of-thoughts, and reinforcement learning.

AI Architecture Video Series

4 videos
Establishing the Modern AI Paradigm (2003 & 2014) thumbnailWatch on YT

Episode 4 covers neural language modeling and sequence-to-sequence learning, moving from Bengio et al.'s 2003 paper "A Neural Probabilistic Language Model" to Sutskever et al.'s 2014 paper "Sequence to Sequence Learning with Neural Networks." Episodes 1-3 covered representation and memory: PDP showed how concepts can be learned as distributed representations, while LSTMs showed how these representations can persist through time. Bengio et al.'s work applies these ideas to language through neural next-token prediction, helping establish a central paradigm in modern AI. Sutskever et al.'s seq2seq model extends this paradigm to more complex sequence tasks such as translation and summarization.

Long Short-Term Memory (1997) thumbnailWatch on YT

Episode 3 covers the 1997 paper "Long Short-Term Memory" by Sepp Hochreiter and Jurgen Schmidhuber. The episode follows PDP because it addresses a limitation of distributed representations alone: learned representations are more useful when they can be preserved and related over time. LSTMs introduce a mechanism for maintaining and updating information across sequences.

Parallel Distributed Processing (1986) thumbnailWatch on YT

Episode 2 covers the PDP research group's 1986 work "Parallel Distributed Processing: Explorations in the Microstructure of Cognition." Following Hopfield, PDP represents a major chronological and conceptual development in AI architecture. It introduces distributed representations, which became an important conceptual foundation for later developments in deep learning.

Hopfield Networks (1982) thumbnailWatch on YT

Episode 1 covers John Hopfield's 1982 paper "Neural Networks and Physical Systems with Emergent Collective Computational Abilities." The series begins with Hopfield, rather than earlier neural models such as the perceptron, because its focus is AI architecture rather than learning algorithms. Hopfield presents a rigorous account of neural networks as dynamical systems, framing neural computation as an emergent property of a network's structure and dynamics.