Files
davideisinger.com/static/archive/www-robinsloan-com-sbm0vr.txt
2024-01-17 12:05:58 -05:00

290 lines
14 KiB
Plaintext
Raw Permalink Blame History

This file contains invisible Unicode characters
This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
[1]Home [2]About [3]Moonbound From: Robin Sloan
To: the lab
Sent: March 2023
Phase change
An extremely close-up photograph of a snowflake, looking almost architectural.
[4]Snowflake, Wilson Bentley, ca. 1910
Earlier this week, in [5]my main newsletter, I praised a new project from Matt
Webb. Here, I want to come at it from a different angle.
Briefly: Matt has built the [6]Braggoscope, a fun and useful application for
exploring the archives of the beloved BBC radio show In Our Time, hosted by the
inimitable Melvyn Bragg.
In Our Time only provides HTML pages for each episodetheres no structured
data, no sense of “episode X is connected to episode Y because of shared
feature Z”.
As Matt explains [7]in his write-up, he fed the plain-language content of each
episode page into the GPT-3 API, cleverly prompting it to extract basic
metadata, along with a few subtler propertiesincluding a Dewey
Decimal number!?
(Explaining how and why a person might prompt a language model is beyond the
scope of this newsletter; you can [8]read up about it here.)
Heres [9]a bit of Matts prompt:
Extract the description and a list of guests from the supplied episode notes from a podcast.
Also provide a Dewey Decimal Classification code and label for the description
Return valid JSON conforming to the following Typescript type definition:
{
"description": string,
"guests": {"name": string, "affiliation": string | null}[]
"dewey_decimal": {"code": string, "label": string},
}
Episode synopsis (Markdown):
{notes}
Valid JSON:
Important to say: it doesnt work perfectly. Matt reports that GPT-3 doesnt
always return valid JSON, and if you browse the Braggoscope, youll find plenty
of questionable filing choices.
And yet! What a technique. (Matt credits Noah Brier for [10]the insight.)
It fits into a pattern Ive noticed: while the buzzy application of the
GPT-alikes is chat, the real workhorse might be text transformation.
As Matt writes:
Sure Google is all-in on AI in products, announcing chatbots to compete
with ChatGPT, and synthesised text in the search engine. BUT.
Using GPT-3 as a function call.
Using GPT-3 as a universal coupling.
It brings a lot within reach.
I think the magnitude of this shiftI would say its on the order of the
web from the mid 90s? There was a radical simplification and democratisa
tion of software (architecture, development, deployment, use) that took
decades to really unfold.
For me, 2022 and 2023 have presented two thick strands of inquiry: the web and
AI, AI and the web. This is evidenced by the structure of these lab
newsletters, which have tended towards birfucation.
Matts thinking is interesting to me because it brings the strands together.
One of the pleasures of HTTP (the original version) is that its almost plain
language, though a very simple kind. You can execute an HTTP request “by hand”:
telnet www.google.com 80 followed by GET /.
Language models as universal couplers begin to suggest protocols that really
are plain language. What if the protocol of the GPT-alikes is just a bare TCP
socket carrying free-form requests and instructions? What if the RSS feed of
the future is simply my language model replying to yours when it asks, “Whats
up with Robin lately?”
I like this because I hate it; because its weird, and makes me
feel uncomfortable.
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
I think its really challenging to find the appropriate stance towards
this stuff.
On one hand, I find critical deflation, of the kind youll hear from Ted
Chiang, Simon Willison, and Claire Leibowicz in [11]this recent episode of KQED
Forum, appropriate and useful. The hype is so powerful that any corrective
is welcome.
However! On the critical side, the evaluation of whats before us isnt
sufficient; not even close. If we demand humility from AI engineers, then we
ought to match it with imagination.
An important fact about these language modelsone that sets them apart from,
say, the personal computer, or the iPhoneis that their capabilities have
been surprising, often confounding, even to their creators.
AI at this moment feels like a mash-up of programming and biology. The program
ming part is obvious; the biology part becomes apparent when you see [12]AI
engineers probing their own creations the way you might probe a mouse in a lab.
The simple fact is: even at the highest levels of theory and practice, no one
knows how these language models are doing what theyre doing.
Over the past few years, in the evolution from GPT-2-alikes to GPT-3-alikes and
beyond, its become clear that the “returns to scale”—both in terms of (1) a
models size and (2) the scope of its training dataare exponential and
nonlinear. Simply adding more works better, and works weirder, than it should.
The nonlinearity is, to me, the most interesting part. As these models have
grown, they have undergone widely observed “phase changes” in capability, just
as sudden and surprising as water frozen or cream whipped.
At the moment, my deepest engagement with a language model is in a channel on a
Discord server, where our gallant host has set up a ChatGPT-powered bot and
laced a simple personality into its prompt. The sociability has been a
revelationmultiplayer ChatGPT is much, MUCH more fun than single player
and, of course, the conversation tends towards goading the bot, testing its
boundaries, luring it into absurdities.
The bot writes poems, sure, and song lyrics, and movie scenes.
The bot also produces ASCII art, and SVG code, and [13]PICO-8 programs, though
they dont always run.
I find myself deeply ambivalent, in the original sense of: thinking many things
at once. Im very aware of the bots limitations, but/and I find myself stunned
by its fluency, its range.
Listen: you can be a skeptic. In some ways, I am! But these phase changes have
happened, and that probably means they will keep happening, and no one knows
(the AI engineers least of all) what might suddenly become possible.
As ever, [14]Jack Clark is my guide. Hes a journalist turned AI practioner,
involved in policy and planning at the highest levels, first at OpenAI, now at
Anthropic. And if hes no longer a disinterested observer, he remains deeply
grounded and moral, which makes me trust him when he says, with confidence:
this is the biggest thing going, and we had all better brace for weird
times ahead.
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
What does that mean, to brace for it?
Ive found it helpful, these past few years, to frame my anxieties and dissatis
factions as questions. For example, fed up with the state of social media, [15]
I asked: what do I want from the internet, anyway?
It turns out I had an answer to that question.
Where the GPT-alikes are concerned, a question thats emerging for me is:
What could I do with a universal functiona tool for turning just about any X
into just about any Y with plain language instructions?
I dont pose that question with any sense of wide-eyed expectation; a reason
able answer might be, nothing much. Not everything in the world depends on the
transformation of symbols. But I think that IS the question, and I think it
takes some legitimate work, some strenuous imagination, to push yourself to
believe it really will be “just about any X” into “just about any Y”.
I help operate [16]a small olive oil company, and I have spent a bit of time
lately considering this question in the context of our business. What might a
GPT-alike do for us? What might an even more capable system do?
My answer, so far, is indeed: nothing much! Its a physical business, after
all, mainly concerned with moving and transforming matter. The “obvious” appli
cation is customer support, which I handle myself, and which I am unwilling to
cede to a computer or, indeed, anyone who isnt me. The specific quality and
character of our support is important.
(As an aside: every customer support request I receive is a miniature puzzle,
usually requiring deduction across several different systems. Many of these
puzzles are challenging even to the general intelligence that is me; if it
comes to pass that a GPT-alike can handle them without breaking a sweat, I will
be very, very impressed.)
(Of course, its not going to happen like that, is it? Long before GPT-alikes
can solve the same problems Robin can, using the tools Robin has, the problems
themselves will change to meet the GPT-alikes halfway. The systems will all
learn to “speak GPT”, in some sense.)
The simple act of asking and answering the question was clarifying and calming.
It plucked AI out of the realm of abstract dread and plunked it down on
the workbench.
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Jack Clark includes, in all of his AI newsletters, a piece of original
micro-fiction. One of them, [17]sent in December, has stayed with me. Ill
reproduce it here in full:
Reality Authentication
[The internet, 2034]
“To login, spit into the bio-API”
I took a sip of water and swirled it around my mouth a bit, then hawked
some spit into the little cup on my desk, put its lid on, then flipped over
the receptacle and plugged it into the bio-API system.
“Authenticatingauthentication successful, human-user identified. Enjoy
your time on the application!”
I spent a couple of hours logged-on, doing a mixture of work and pleasure.
I was part of an all-human gaming league called the No-Centaurs; we came
second in a mini tournament. I also talked to my therapist sans his
augment, and I sent a few emails over the BioNet protocol.
When I logged out, I went back to the regular internet. Since the AI models
had got minituarized and proliferated a decade ago, the internet had
radically changed. For one thing, it was so much faster now. It was also
dangerous in ways it hadnt been before - Attention Harvesters were every
where and the only reason I was confident in my browsing was Id paid for a
few protection programs.
I think “brace for it” might mean imagining human-only spaces, online and off.
We might be headed, paradoxically, for a golden age of “get that robot out of
my face”.
In the extreme case, if AI doesnt wreck the world, language models could
certainly wreck the internet, like Jacks Attention Harvesters above. Maybe
well look back at the Web Parenthesis, 1990-2030. It was weird and fun, though
no one in the future will quite understand the appeal.
We are living and thinking together in an interesting time. My recommendation
is to avoid chasing the ball of AI around the field, always a step behind.
Instead, set your stance a little wider and form a question that actually
matters to you.
It might be as simple as: is this kind of capability, extrapolated forward,
useful to me and my work? If so, how?
It might be as wacky as: what kind of protocol could I build around plain
language, the totally sci-fi vision of computers just TALKING to each other?
It might even be my original question, or a version of it: what do I want from
the internet, anyway?
From Oakland,
Robin
March 2023, Oakland
I'm [18]Robin Sloan, a fiction writer. You can sign up for my lab newsletter:
[19][ ] [20][Subscribe]
This website doesnt collect any information about you or your reading.
It aspires to the speed and privacy of the printed page.
Dont miss [21]the colophon. Hony soyt qui mal pence
References:
[1] https://www.robinsloan.com/
[2] https://www.robinsloan.com/about/
[3] https://www.robinsloan.com/moonbound/
[4] https://publicdomainreview.org/essay/the-snowflake-man-of-vermont?utm_source=Robin_Sloan_sent_me
[5] https://www.robinsloan.com/newsletters/ring-got-good/?utm_source=Robin_Sloan_sent_me
[6] https://genmon.github.io/braggoscope/?utm_source=Robin_Sloan_sent_me
[7] https://interconnected.org/home/2023/02/07/braggoscope?utm_source=Robin_Sloan_sent_me
[8] https://help.openai.com/en/articles/6654000-best-practices-for-prompt-engineering-with-openai-api?utm_source=Robin_Sloan_sent_me
[9] https://news.ycombinator.com/item?id=35073824&utm_source=Robin_Sloan_sent_me
[10] https://brxnd.substack.com/p/the-prompt-to-rule-all-prompts-brxnd?utm_source=Robin_Sloan_sent_me
[11] https://www.kqed.org/forum/2010101892368/how-to-wrap-our-heads-around-these-new-shockingly-fluent-chatbots?utm_source=Robin_Sloan_sent_me
[12] https://www.anthropic.com/index/toy-models-of-superposition-2?utm_source=Robin_Sloan_sent_me
[13] https://www.lexaloffle.com/pico-8.php?utm_source=Robin_Sloan_sent_me
[14] https://importai.substack.com/?utm_source=Robin_Sloan_sent_me
[15] https://www.robinsloan.com/lab/specifying-spring-83/
[16] https://fat.gold/?utm_source=Robin_Sloan_sent_me
[17] https://us13.campaign-archive.com/?u=67bd06787e84d73db24fb0aa5&&id=a03ebcd500&utm_source=Robin_Sloan_sent_me
[18] https://www.robinsloan.com/about?utm_source=Robin_Sloan_sent_me
[21] https://www.robinsloan.com/colophon/