AbstractPhila's picture

Open to Collab

AbstractPhila PRO

AbstractPhil

AbstractPowered

·

https://civitai.com/user/AbstractPhila

AbstractEyes

AI & ML interests

datasets, research papers, experimentation, vision, classification, text encoders, tokenization, llms, diffusion, distillation, and more.

Recent Activity

updated a model about 6 hours ago

AbstractPhil/geolip-aleph-void

reacted to OzTianlu's post with 🧠 about 21 hours ago

ResNet is Explicit Euler. GPT is Implicit Euler. What Else is Hiding in Plain Sight? Read online: https://datawhalechina.github.io/learning-terrain/ I wrote an open-source monograph on learning dynamics — The Terrain of Learning. Bilingual (Chinese/English), 4 volumes, 12 chapters, 30+ print-grade figures. Completely free (CC BY-NC-SA 4.0). The core argument: gradient descent is not optimization. It's terrain motion. The loss function is a landscape. The gradient is the direction of slope. The optimizer is how you choose each step. Once you see it this way, everything clicks: ResNet = explicit Euler integration on a vector field. The residual branch is the vector field. Each layer takes one Euler step. GPT autoregression = implicit-state Euler iteration. Stable where explicit Euler explodes. That's why transformers handle long-range dependencies. DEQ = the Banach fixed-point theorem in production. The forward pass is root-finding. There are no layers to backprop through. KL divergence = a Bregman divergence on the entropy landscape. Your belief space is curved, not flat. Chain-of-thought reasoning = hidden states flowing along a reasoning field toward an attractor basin. Correct answers have wide basins. The number of reasoning steps is determined by the terrain, not by the problem. Diffusion models = systems flowing downhill along a score vector field, from noise to structure, from high energy to low energy. The book traces one idea across 337 years — from F=ma (Newton, 1687) to H=T+V (Hamilton, 1833) to loss landscape + gradient field (2020s). Hamilton replaced a catalog of forces with one geometric object. This book does the same for deep learning. GitHub: https://github.com/datawhalechina/learning-terrain Discussion: https://github.com/datawhalechina/learning-terrain/discussions/2 Convergence is not hope. Convergence is geometry. You see.

repliedto OzTianlu's post about 21 hours ago

ResNet is Explicit Euler. GPT is Implicit Euler. What Else is Hiding in Plain Sight? Read online: https://datawhalechina.github.io/learning-terrain/ I wrote an open-source monograph on learning dynamics — The Terrain of Learning. Bilingual (Chinese/English), 4 volumes, 12 chapters, 30+ print-grade figures. Completely free (CC BY-NC-SA 4.0). The core argument: gradient descent is not optimization. It's terrain motion. The loss function is a landscape. The gradient is the direction of slope. The optimizer is how you choose each step. Once you see it this way, everything clicks: ResNet = explicit Euler integration on a vector field. The residual branch is the vector field. Each layer takes one Euler step. GPT autoregression = implicit-state Euler iteration. Stable where explicit Euler explodes. That's why transformers handle long-range dependencies. DEQ = the Banach fixed-point theorem in production. The forward pass is root-finding. There are no layers to backprop through. KL divergence = a Bregman divergence on the entropy landscape. Your belief space is curved, not flat. Chain-of-thought reasoning = hidden states flowing along a reasoning field toward an attractor basin. Correct answers have wide basins. The number of reasoning steps is determined by the terrain, not by the problem. Diffusion models = systems flowing downhill along a score vector field, from noise to structure, from high energy to low energy. The book traces one idea across 337 years — from F=ma (Newton, 1687) to H=T+V (Hamilton, 1833) to loss landscape + gradient field (2020s). Hamilton replaced a catalog of forces with one geometric object. This book does the same for deep learning. GitHub: https://github.com/datawhalechina/learning-terrain Discussion: https://github.com/datawhalechina/learning-terrain/discussions/2 Convergence is not hope. Convergence is geometry. You see.

View all activity

Organizations

AbstractPhil 's models 190

AbstractPhil/geolip-aleph-void

Feature Extraction • Updated about 6 hours ago

AbstractPhil/geolip-sdxl-aleph

Text-to-Image • Updated 6 days ago • • 1

AbstractPhil/geolip-hypersphere-experiments

Updated 11 days ago • 1

AbstractPhil/geolip-svae-transformer

Feature Extraction • Updated 14 days ago

AbstractPhil/sd15-flow-lune-flux

Updated 20 days ago

AbstractPhil/SDXL-Simulacrum-V3-1

0.2B • Updated 20 days ago

AbstractPhil/geolip-SVAE

Updated 26 days ago • 2

AbstractPhil/sd15-flow-lune-json-geolip-vit

Updated 27 days ago

AbstractPhil/sd15-flow-lune-json-geolip-prompt

Updated 27 days ago

AbstractPhil/qwen3.5-0.8b-task_1-lora-v2

Updated 27 days ago

AbstractPhil/sd15-flow-lune-json-prompt

Updated 27 days ago

AbstractPhil/sd15-flow-lune-json-vit

Updated 27 days ago

AbstractPhil/qwen-json-finetunes-dump

Updated 28 days ago

AbstractPhil/qwen3.5-0.8b-task_1-lora-v2-stage1

Updated 28 days ago

AbstractPhil/qwen3.5-0.8b-task_1-lora

Text Generation • Updated about 1 month ago • 34

AbstractPhil/qwen3.5-0.8b-task_3-lora

Text Generation • Updated about 1 month ago • 1

AbstractPhil/qwen3.5-0.8b-task_2-lora

Text Generation • Updated about 1 month ago • 2

AbstractPhil/geolip-svae-text

AbstractPhil/geolip-svae-implicit-solver-experiments

AbstractPhil/geolip-svae-h2-64

11M • Updated Apr 25 • 6

AbstractPhil/geolip-svae-ablations

AbstractPhil/geolip-svae-batteries

Other • Updated Apr 20

AbstractPhil/geolip-cvae-proto

AbstractPhil/geolip-svd-encoder-sweeps

AbstractPhil/geolip-spectral-cell

AbstractPhil/geolip-spectral-vit

AbstractPhil/geolip-conduit-experiments

AbstractPhil/geolip-svd-reconstitution

AbstractPhil/svae-freckles-4096

AbstractPhil/svae-freckles-256