Skip to content

Latest commit

 

History

History
260 lines (228 loc) · 9.32 KB

2025_01_06.md

File metadata and controls

260 lines (228 loc) · 9.32 KB

Oxide and Friends: January 6th, 2025

Predictions 2025

We've been hosting a live show weekly on Mondays at 5p for about an hour, and recording them all; here is the recording.

In addition to Bryan Cantrill and Adam Leventhal, our guests were Simon Willison, Mike Cafarella, Steve Tuck, and Steve Klabnik.

Futurist 1 year 3 year 6 year
Steve Congestion pricing in NYC will be an unambiguous success. It will still exist and sentiment will be positive. Some government contracts will require a memory safety roadmap included in the procurement process. AI will no longer be the hottest sector of investment in Silicon Valley, in the same way that AI replaced Web3 as such.
Steve Tuck AMD acquires a software startup for over $1B to (try to) solve their software problems Fox Rental Car is bigger than Hertz Rental Intel is out of the Foundry business
Adam Web3 is back (baby!) both as a term and an investment focus for VC. Crypto hype is back in force. Chips crisis--brought on by geopolitics, tarriffs, natural disaster, etc. leading to shortages and a small set of companies who can get the choicest processors (and processes) (Oxide is among them). AI is mostly done on even more specialized hardware that doesn't resemble GPUs (e.g. not CUDA). This enables a diversity of vendors.
Bryan 2025 is the Year of AI Efficiency: era of 10X YoY pretraining cluster growth is over, with cluster flattening being treated with various euphemisms ("pause", "correction", "consolidation", etc.) Emphasis swings to training with less resources (DeepSeek) and test-time compute.
Blackwell struggles: thermal issues, yield issues, reliability issues, pricepoint, changing market conditions. H100/H200 thrive.
Intel's CEO search is an unmitigated disaster, with warring constiuencies unable to agree; the year ends with the co-CEOs still in place
Cybertruck is no longer being manufactured
After much tumult, Intel Foundry Services has been spun out of Intel -- and does not bear the Intel name. It has been purchased for $1 by a deep-pocketed maverick.
A new product/service has used LLMs to completely revolutionize podcast search/listening
Secondary education has been revolutionized by LLMs/AI
Postsecondary degrees in CS (and related) are in freefall: below 70k/year and dropping (~2015 levels)
Simon (more on my blog) Agents will fail to happen, again. Lots of people will try to build AI assistants capable of acting autonomously on your behalf. Gullibity and reliability will render them useless, and a lot of investors will lose their shirts.
... research assistants and code assistants (code/debug-in-a-loop) will continue to work great though.
Someone will win a Pulitzer for investigative reporting where LLM tools were part of their process.
We'll get some privacy laws with teeth that govern things like training on input data from end-users - not federal, but may state or international.
Utopian: we'll get some really amazing art. Oscar winning films will use generative AI tools and it won't even be surprising or worth discussing. (Update: turns out Everything Everywhere All at Once used Runway ML already)
Dystopian: if AGI that can do almost any job a human can do emerges, the economic and civic impact will be really unpleasant.

Join us live to drop your predictions and/or make a PR adding your predictions to this table. We'll accept predictions through January 2025.

Futurist 1 year 3 year 6 year
Fable Tales bsky TSMC's arizona fab will be oversubscribed by end of 2025. We will have conclusively decided that current approaches to AI (i.e. LLMS) will not produce AGI or ASI. CPU/GPU on a chip ala apple silicon is the norm in all consumer computing, and has somewhat high enterprise/server penetration. Many applications that today do not take advantage of GPU availability are offloading work to them.
Joe Thompson One of the major AI companies will be forced to shut down either due to running out of investment cash or due to a data breach of such severity they're basically forced to close up shop. The tension between open source and "community source" will finally break out into the open with an OSI-style organization formed to classify and endorse licenses that allow permissive access to source but restrict its use in various ways. There will be at least one single-day loss of a trillion dollars in market cap of a major publicly-traded tech company.

Kelsey Hightower will be acclaimed CNCF God-Emperor For Life.

I will finally understand how git works.

Kris Mojo hits top 25 in TIOBE Index. Handheld gaming dominates, Switch 2 and Steam Deck 2 lead the shift from flagship consoles to flagship portables. Swift-5-to-6 considered a Python-2-to-3 failure writ small. Waymo’s success balances out Google’s ongoing decline. Household cleaning robots beyond vacuums are commonplace.
Ellie (bsky: https://bsky.app/profile/ellie.fm) We crack the Tbps barrier in the enterprise for a single ethernet link (currently 800Gbps)

Hardware for CXL will enter GA, software won't be there to take advantage quite yet

RISC-V enters the datacenter via Tenstorrent in the form of "AI acceleration cards". Nobody figures out how the hell to use them for about 2-3 years though.

25Gbps over CAT7/CAT8 happens, meaning that we'll see more consumer hardware supporting it as a step up from 10GbE.

CXL is used in the building of a HPC cluster, supplementing InfiniBand / Ethernet. Software support starts picking up, people start figuring out how to use it

Someone outside of AMD/NVIDIA/AWS/Azure finally figures out how to properly make use of DPUs, and we see proper software support & adoption. Could be in the form of CNIs (for Kubernetes), whatnot.

We see a rise in more exotic cooling solutions in the datacenter (mineral oil immersion, water-based, maybe even liquid nitrogen?) due to ever-increasing power requirements from GPUs + x86 servers

OCP racks will enter GA, meaning blind-mated 48VDC power backplanes for all, albeit in a vastly different form factor from regular 42U racks.

K8s will continue to remain supreme and eat up more market share from VMware's remains

It becomes seamless to transition off of x86 -> ARM, thanks to a new compatibility layer that {Amazon, Ampere, NVIDIA, etc} developed, very similar to Rosetta 2 (Apple).

GPUs are entirely racked separately from compute blades in datacenters, connected over CXL fabric (rack-to-rack).

ARM market-share in the datacenter increases dramatically, thanks to better thermals + performance compared to x86, as well as the aforementioned compatibility layer.

Kevin Webb Someone releases a non-chat, task-specific deep learning application that goes viral/breaks through the noise on AGI/"One Model to Rule Them All" narrative pushed by OpenAI et al. Mainstream journalism discovers that most folks working in the industry are building practical task-specific deep learning applications, and that chat-based AGI is a marketing ploy, not a technology.

Inference costs for GPT-oÆ-12, and lobbying for deregulation of nuclear energy bankrupts OpenAI despite attempts to charge 100x premium on GPU compute.

Nvidia solves driver installation challenges stymying AI industry. Over 100 AI infrastructure startups fold as their business model collapses.
Ian Grunert Ban on new sales of tp-link router hardware in the USA

OpenAI pricing or usage limit changes to prevent losing money on power users of their current flat monthly pricing schemas
Apple Xserve line returns / Apple starts selling server hardware again Waymo overtakes Uber in rider miles per day

If we got something wrong or missed something, please file a PR! Our next show will likely be on Monday at 5p Pacific Time on our Discord server; stay tuned to our Mastodon feeds for details, or subscribe to this calendar. We'd love to have you join us, as we always love to hear from new speakers!