Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Open to Collab
19
6
24
AbstractPhila
PRO
AbstractPhil
Follow
tyro12's profile picture
webxos's profile picture
Duskfallcrew's profile picture
85 followers
·
124 following
https://civitai.com/user/AbstractPhila
AbstractEyes
AI & ML interests
datasets, research papers, experimentation, vision, classification, text encoders, tokenization, llms, diffusion, distillation, and more.
Recent Activity
updated
a model
about 6 hours ago
AbstractPhil/geolip-aleph-void
reacted
to
OzTianlu
's
post
with 🧠
about 21 hours ago
ResNet is Explicit Euler. GPT is Implicit Euler. What Else is Hiding in Plain Sight? Read online: https://datawhalechina.github.io/learning-terrain/ I wrote an open-source monograph on learning dynamics — The Terrain of Learning. Bilingual (Chinese/English), 4 volumes, 12 chapters, 30+ print-grade figures. Completely free (CC BY-NC-SA 4.0). The core argument: gradient descent is not optimization. It's terrain motion. The loss function is a landscape. The gradient is the direction of slope. The optimizer is how you choose each step. Once you see it this way, everything clicks: ResNet = explicit Euler integration on a vector field. The residual branch is the vector field. Each layer takes one Euler step. GPT autoregression = implicit-state Euler iteration. Stable where explicit Euler explodes. That's why transformers handle long-range dependencies. DEQ = the Banach fixed-point theorem in production. The forward pass is root-finding. There are no layers to backprop through. KL divergence = a Bregman divergence on the entropy landscape. Your belief space is curved, not flat. Chain-of-thought reasoning = hidden states flowing along a reasoning field toward an attractor basin. Correct answers have wide basins. The number of reasoning steps is determined by the terrain, not by the problem. Diffusion models = systems flowing downhill along a score vector field, from noise to structure, from high energy to low energy. The book traces one idea across 337 years — from F=ma (Newton, 1687) to H=T+V (Hamilton, 1833) to loss landscape + gradient field (2020s). Hamilton replaced a catalog of forces with one geometric object. This book does the same for deep learning. GitHub: https://github.com/datawhalechina/learning-terrain Discussion: https://github.com/datawhalechina/learning-terrain/discussions/2 Convergence is not hope. Convergence is geometry. You see.
replied
to
OzTianlu
's
post
about 21 hours ago
ResNet is Explicit Euler. GPT is Implicit Euler. What Else is Hiding in Plain Sight? Read online: https://datawhalechina.github.io/learning-terrain/ I wrote an open-source monograph on learning dynamics — The Terrain of Learning. Bilingual (Chinese/English), 4 volumes, 12 chapters, 30+ print-grade figures. Completely free (CC BY-NC-SA 4.0). The core argument: gradient descent is not optimization. It's terrain motion. The loss function is a landscape. The gradient is the direction of slope. The optimizer is how you choose each step. Once you see it this way, everything clicks: ResNet = explicit Euler integration on a vector field. The residual branch is the vector field. Each layer takes one Euler step. GPT autoregression = implicit-state Euler iteration. Stable where explicit Euler explodes. That's why transformers handle long-range dependencies. DEQ = the Banach fixed-point theorem in production. The forward pass is root-finding. There are no layers to backprop through. KL divergence = a Bregman divergence on the entropy landscape. Your belief space is curved, not flat. Chain-of-thought reasoning = hidden states flowing along a reasoning field toward an attractor basin. Correct answers have wide basins. The number of reasoning steps is determined by the terrain, not by the problem. Diffusion models = systems flowing downhill along a score vector field, from noise to structure, from high energy to low energy. The book traces one idea across 337 years — from F=ma (Newton, 1687) to H=T+V (Hamilton, 1833) to loss landscape + gradient field (2020s). Hamilton replaced a catalog of forces with one geometric object. This book does the same for deep learning. GitHub: https://github.com/datawhalechina/learning-terrain Discussion: https://github.com/datawhalechina/learning-terrain/discussions/2 Convergence is not hope. Convergence is geometry. You see.
View all activity
Organizations
AbstractPhil
's models
190
Sort: Recently updated
AbstractPhil/geolip-aleph-void
Feature Extraction
•
Updated
about 6 hours ago
AbstractPhil/geolip-sdxl-aleph
Text-to-Image
•
Updated
6 days ago
•
•
1
AbstractPhil/geolip-hypersphere-experiments
Updated
11 days ago
•
1
AbstractPhil/geolip-svae-transformer
Feature Extraction
•
Updated
14 days ago
AbstractPhil/sd15-flow-lune-flux
Updated
20 days ago
AbstractPhil/SDXL-Simulacrum-V3-1
0.2B
•
Updated
20 days ago
AbstractPhil/geolip-SVAE
Updated
26 days ago
•
2
AbstractPhil/sd15-flow-lune-json-geolip-vit
Updated
27 days ago
AbstractPhil/sd15-flow-lune-json-geolip-prompt
Updated
27 days ago
AbstractPhil/qwen3.5-0.8b-task_1-lora-v2
Updated
27 days ago
AbstractPhil/sd15-flow-lune-json-prompt
Updated
27 days ago
AbstractPhil/sd15-flow-lune-json-vit
Updated
27 days ago
AbstractPhil/qwen-json-finetunes-dump
Updated
28 days ago
AbstractPhil/qwen3.5-0.8b-task_1-lora-v2-stage1
Updated
28 days ago
AbstractPhil/qwen3.5-0.8b-task_1-lora
Text Generation
•
Updated
about 1 month ago
•
34
AbstractPhil/qwen3.5-0.8b-task_3-lora
Text Generation
•
Updated
about 1 month ago
•
1
AbstractPhil/qwen3.5-0.8b-task_2-lora
Text Generation
•
Updated
about 1 month ago
•
2
AbstractPhil/geolip-svae-text
Updated
May 8
AbstractPhil/geolip-svae-implicit-solver-experiments
Updated
Apr 25
AbstractPhil/geolip-svae-h2-64
11M
•
Updated
Apr 25
•
6
AbstractPhil/geolip-svae-ablations
Updated
Apr 24
AbstractPhil/geolip-svae-batteries
Other
•
Updated
Apr 20
AbstractPhil/geolip-cvae-proto
Updated
Apr 20
AbstractPhil/geolip-svd-encoder-sweeps
Updated
Apr 18
AbstractPhil/geolip-spectral-cell
Updated
Apr 16
AbstractPhil/geolip-spectral-vit
Updated
Apr 15
AbstractPhil/geolip-conduit-experiments
Updated
Apr 11
AbstractPhil/geolip-svd-reconstitution
Updated
Apr 10
AbstractPhil/svae-freckles-4096
Updated
Apr 9
AbstractPhil/svae-freckles-256
Updated
Apr 9
Previous
1
2
3
...
7
Next