Notices by Simon Willison (simon@fedi.simonwillison.net)

Simon Willison (simon@fedi.simonwillison.net)'s status on Friday, 07-Feb-2025 19:48:46 JST Simon Willison

Today I learned about SmolLM2-135M-Instruct, a tiny LLM which quantizes down to just below 100MB... which means it can fit in a PyPI package!
Here's the first LLM plugin that includes a full model as part of the package:
llm install llm-smollm2
https://simonwillison.net/2025/Feb/7/pip-install-llm-smollm2/

In conversation about 15 days ago from fedi.simonwillison.net permalink
Simon Willison (simon@fedi.simonwillison.net)'s status on Friday, 07-Feb-2025 19:48:45 JST Simon Willison
in reply to

Open challenge: can anyone find a useful application of a model this small?

In conversation about 15 days ago from fedi.simonwillison.net permalink
Simon Willison (simon@fedi.simonwillison.net)'s status on Friday, 07-Feb-2025 19:48:44 JST Simon Willison
in reply to

If you have uv installed this means you can start chatting with a small model without first installing anything at all - this command will create an ephemeral virtual environment, install the necessary pieces and start a chat UI running in your terminal:
uvx --with llm-smollm2 llm chat -m SmolLM2

In conversation about 15 days ago from fedi.simonwillison.net permalink
Simon Willison (simon@fedi.simonwillison.net)'s status on Friday, 07-Feb-2025 19:48:43 JST Simon Willison
in reply to
- Terence Eden
@Edent hah, looks like it's trying to find a Rust compiler! I guess maybe for Pydantic which doesn't have binary wheels for that platform?

In conversation about 15 days ago from fedi.simonwillison.net permalink
Simon Willison (simon@fedi.simonwillison.net)'s status on Thursday, 02-Jan-2025 18:32:54 JST Simon Willison

Here's my end-of-year review of things we learned about LLMs in 2024 - we learned a LOT of things https://simonwillison.net/2024/Dec/31/llms-in-2024/
Table of contents:
In conversation about 2 months ago from fedi.simonwillison.net permalink
Attachments
1. The GPT-4 barrier was comprehensively broken Some of those GPT-4 models run on my laptop LLM prices crashed, thanks to competition and increased efficiency Multimodal vision is common, audio and video are starting to emerge Voice and live camera mode are science fiction come to life Prompt driven app generation is a commodity already Universal access to the best models lasted for just a few short months “Agents” still haven’t really happened yet Evals really matter Apple Intelligence is bad, Apple’s MLX library is excellent The rise of inference-scaling “reasoning” models Was the best currently available LLM trained in China for less than $6m? The environmental impact got better The environmental impact got much, much worse The year of slop Synthetic training data works great LLMs somehow got even harder to use Knowledge is incredibly unevenly distributed LLMs need better criticism Everything tagged “llms” on my blog in 2024
  https://cdn.masto.host/fedisimonwillisonnet/media_attachments/files/113/748/779/363/034/653/original/ee7621dda992ed1c.png
2. Untitled attachment
Simon Willison (simon@fedi.simonwillison.net)'s status on Thursday, 02-Jan-2025 18:32:50 JST Simon Willison
in reply to
- iveyline
@iveyline I really hope not. I like LLMs that augment human abilities - that give us new tools. That's one of the reasons I'm unexcited about the idea of "AGI" - that sounds like a human-replacement play to me, which doesn't interest me at all.

In conversation about 2 months ago from fedi.simonwillison.net permalink
Simon Willison (simon@fedi.simonwillison.net)'s status on Friday, 18-Oct-2024 04:34:30 JST Simon Willison

Video scraping: extracting JSON data from a 35 second screen capture for less than 1/10th of a cent https://simonwillison.net/2024/Oct/17/video-scraping/
I needed to extract information from a dozen emails in my inbox... so I ran a screen capture tool, clicked through each of them in turn and then got Google's Gemini 1.5 Flash multi-modal LLM to extract (correct, I checked it) JSON data from that 35 second video.
Total cost for 11,018 tokens: $0.00082635
In conversation about 4 months ago from fedi.simonwillison.net permalink
Attachments
1. Untitled attachment
Simon Willison (simon@fedi.simonwillison.net)'s status on Wednesday, 10-Jul-2024 04:02:18 JST Simon Willison

It turns out Google Chrome ships a default, hidden extension that allows code on `*.google.com` access to private APIs, including your current CPU usage
You can test it out by pasting the following into your Chrome DevTools console on any Google page:
chrome.runtime.sendMessage(
"nkeimhogjdpnpccoofpliimaahmaaome",
{ method: "cpu.getInfo" },
(response) => {
console.log(JSON.stringify(response, null, 2));
},
);
More notes here: https://simonwillison.net/2024/Jul/9/hangout_servicesthunkjs/
In conversation about 8 months ago from fedi.simonwillison.net permalink
Attachments
1. No result found on File_thumbnail lookup.
  
  hangout_services/thunk.js
  
  from @simonw
  
  It turns out Google Chrome (via Chromium) includes a default extension which makes extra services available to code running on the `*.google.com` domains - tweeted about today [by Luca Casonato](https://twitter.com/lcasdev/status/1810696257137959018), …
Simon Willison (simon@fedi.simonwillison.net)'s status on Monday, 13-May-2024 01:46:16 JST Simon Willison

Several of the major social media platforms - Instagram, TikTok, LinkedIn, Twitter - have effectively declared war on linking to things and I absolutely hate it
"Link in my bio" / "Link in thread" / "Link in first comment"... or increasingly no link at all, just an unsourced screenshot of a page

In conversation about 10 months ago from fedi.simonwillison.net permalink
Simon Willison (simon@fedi.simonwillison.net)'s status on Thursday, 02-Nov-2023 12:14:03 JST Simon Willison
in reply to
@feditips @Snowshadow @mhoye I think it's this bug here https://github.com/mastodon/mastodon/issues/24676
In conversation about a year ago from fedi.simonwillison.net permalink
Attachments
1. Linking the first post in a long thread appears to hide the thread · Issue #24676 · mastodon/mastodon
  
  Steps to reproduce the problem Follow this link while logged in to Mastodon.social. https://mastodon.social/@mcc/108199886340178151 This thread is quite long and has many replies (I did a diary kin...
Simon Willison (simon@fedi.simonwillison.net)'s status on Thursday, 02-Nov-2023 12:05:42 JST Simon Willison
in reply to
@feditips @Snowshadow @mhoye that's the bug: it's definitely a thread!
You can see that it's a thread if you follow this link to the second post in that thread: https://mastodon.social/@mhoye/111336017090790537
In conversation about a year ago from social.076.moe permalink
Attachments
1. mhoye (@mhoye@mastodon.social)
  
  from mhoye
  
  Attached: 1 image The number of people in this thread who are willing to tell me that the real problem is how stupid everyone is is incredibly embarrassing. Look, here's what happens when you compile Hello World with single quotes instead of double quotes. Note, please, that these are warnings, not errors. GCC finishes and produces an a.out executable.
Simon Willison (simon@fedi.simonwillison.net)'s status on Thursday, 02-Nov-2023 09:34:08 JST Simon Willison
- mhoye
OK, I have a somewhat baffling (to me) Mastodon question. How do I link to a thread?
I want to link to a fantastic thread by @mhoye - but if I link to the first post in that thread - https://mastodon.social/@mhoye/111335603309582734 - I get a page with a single post on it and other people's replies, with no indication it's part of a larger thread from the same author
Am I missing something here?
In conversation about a year ago from fedi.simonwillison.net permalink
Attachments
1. No result found on File_thumbnail lookup.
  
  mhoye (@mhoye@mastodon.social)
  
  from mhoye
  
  People go to Stack Overflow because the docs and error messages are garbage. TLDR exists because the docs and error messages are garbage. People ask ChatGPT for help because the docs and error messages are garbage. We are going to lose a generation of competence and turn programming into call-and-response glyph-engine supplicancy because we let a personality cult that formed around the PDP-11 in the 1970s convince us that it was pure and good that docs and error messages are garbage.
Simon Willison (simon@fedi.simonwillison.net)'s status on Thursday, 01-Jun-2023 03:50:06 JST Simon Willison

Question for people who understand how US non-profits work - how normal is it to spend $247,000 on "CEO outsourced services" as an independent contractor?
Just poking around in https://projects.propublica.org/nonprofits/organizations/133444882
In conversation Thursday, 01-Jun-2023 03:50:06 JST from fedi.simonwillison.net permalink
Attachments
1. Screenshot from a tax form section b. independent contraccors Complete this table for your five highest compensated independent contractors that received more than $100,000 of compensation from the organization. Report compensation for the calendar year ending with or within the organization's tax year. (A) Name and business address (B) Description of services CEO OUTSOURCED SERVICES (C) Compensation 247,000 TRANSCEND4GOOD 39141 240TH ST LETCHER, SD 573595305 COO & CDO OUTSOURCED SERVICES 207,000 MARGO K LUCERO 4537 DAFFODIL TRAIL PLANO, TX 75093 HOGAN LOVELLS LEGAL SERVICES 107.518 8350 BROAD STREET 17TH FLOOR TYSONS.VA 22102
  https://cdn.masto.host/fedisimonwillisonnet/media_attachments/files/110/464/713/789/882/346/original/ee5691c1ff155526.jpeg
2. National Eating Disorders Association - Nonprofit Explorer - ProPublica
  
  from @ProPublica
  
  Since 2013, the IRS has released data culled from millions of nonprofit tax filings. Use this database to find organizations and see details like their executive compensation, revenue and expenses, as well as download tax filings going back as far as 2001.
Simon Willison (simon@fedi.simonwillison.net)'s status on Thursday, 06-Apr-2023 22:36:24 JST Simon Willison

We accidentally invented computers that can lie to us and we can't figure out how to make them stop

In conversation Thursday, 06-Apr-2023 22:36:24 JST from fedi.simonwillison.net permalink
Simon Willison (simon@fedi.simonwillison.net)'s status on Thursday, 06-Apr-2023 22:36:23 JST Simon Willison
in reply to

(If you don't think it's possible for a computer to deliberately lie, take a look at "sycophancy" and "sandbagging" in the field of large language models! https://simonwillison.net/2023/Apr/5/sycophancy-sandbagging/ )
In conversation Thursday, 06-Apr-2023 22:36:23 JST from fedi.simonwillison.net permalink
Attachments
1. No result found on File_thumbnail lookup.
  
  A quote from Sam Bowman
  
  from @simonw
  
  More capable models can better recognize the specific circumstances under which they are trained. Because of this, they are more likely to learn to act as expected in precisely those …
Simon Willison (simon@fedi.simonwillison.net)'s status on Monday, 03-Apr-2023 11:14:06 JST Simon Willison
- Natalie Downe
Here's my latest weirdly specific GPT-4 enhanced project: we wanted to measure the temperature of a microwave Raku kiln (yes, that's a thing - talk to @natbat about it) over time without tediously watching the thermometer for hours... so instead we recorded a video of the thermometer then used ffmpeg and Google Cloud Vision to OCR readings from it into a database https://til.simonwillison.net/googlecloud/video-frame-ocr
In conversation Monday, 03-Apr-2023 11:14:06 JST from fedi.simonwillison.net permalink
Attachments
1. The Cloud Vision "Try the API" tool. My photo is on the left with boxes drawn on it around the model number of the thermometer, the LCD temperature display and the little C shown below it. On the right is a Block 1 box with the model number and Block 2 showing 834, the current temperature.
  https://cdn.masto.host/fedisimonwillisonnet/media_attachments/files/110/130/412/449/458/864/original/feeb750fcfe6e007.png
2. Reading thermometer temperatures over time from a video
  
  from @simonw
  
  [Natalie](https://www.instagram.com/natbat.art/) has been experimenting with using a microwave as a kiln for pottery, specifically for [Raku](https://en.wikipedia.org/wiki/Raku_ware). She wanted to u
Simon Willison (simon@fedi.simonwillison.net)'s status on Thursday, 16-Mar-2023 12:27:09 JST Simon Willison

I expect GPT-4 will have a LOT of applications in web scraping
The increased 32,000 token limit will be large enough to send it the full DOM of most pages, serialized to HTML - then ask questions to extract data
Or... take a screenshot and use the GPT4 image input mode to ask questions about the visually rendered page instead!
Might need to dust off all of those old semantic web dreams, because the world's information is rapidly becoming fully machine readable

In conversation Thursday, 16-Mar-2023 12:27:09 JST from fedi.simonwillison.net permalink
Simon Willison (simon@fedi.simonwillison.net)'s status on Wednesday, 15-Mar-2023 04:53:36 JST Simon Willison
in reply to
- matrix07012 :thotpatrol:
@matrix I think both!

In conversation Wednesday, 15-Mar-2023 04:53:36 JST from fedi.simonwillison.net permalink
Simon Willison (simon@fedi.simonwillison.net)'s status on Wednesday, 15-Mar-2023 04:29:54 JST Simon Willison
in reply to
- matrix07012 :thotpatrol:
@matrix Trying that now and the results are pretty extraordinary, it seems to be able to write Datasette plugins from scratch

In conversation Wednesday, 15-Mar-2023 04:29:54 JST from fedi.simonwillison.net permalink
Simon Willison (simon@fedi.simonwillison.net)'s status on Wednesday, 15-Mar-2023 03:53:23 JST Simon Willison

Just got access to GPT4 via ChatGPT and it's doing shockingly (and creepily) well on my test of "Who is X?" - here's its answers for Simon Willison and for Andy Baio
Both of them appear to be entirely accurate - in the past I've seen all sorts of wild hallucinations from this kind of prompt
In conversation Wednesday, 15-Mar-2023 03:53:23 JST from fedi.simonwillison.net permalink
Attachments
1. Me: Who is Simon Willison? GPT4: Simon Willison is a British software engineer and entrepreneur known for his contributions to the world of web development. He co-created the Django web framework along with Adrian Holovaty in 2005, which has since become one of the most popular web frameworks for Python developers. Django is known for its ease of use, flexibility, and robustness, making it a popular choice for building web applications. Willison has also been involved in various other open-source projects and initiatives, such as the jQuery JavaScript library, Lanyrd (a social conference directory), and Datasette (a tool for exploring and publishing data). He is an advocate for open web standards and frequently speaks at conferences and other events about web development and related topics. Simon Willison has also written extensively on his blog about various subjects, including web development, programming, and technology trends.
  https://cdn.masto.host/fedisimonwillisonnet/media_attachments/files/110/022/939/921/464/621/original/bce33268f5e2ebba.png
2. Me: Who is Andy Baio? GPT4: Andy Baio is an American technologist, writer, and entrepreneur best known for his work in various internet projects and startups. Born on April 22, 1977, Baio gained prominence for his blog Waxy.org, which he started in 2002. The blog covers technology, web culture, and digital media and has been influential in the online community. In addition to his blog, Baio has been involved in several notable projects, including: 1. XOXO Festival: In 2012, Baio co-founded XOXO, an annual arts and technology conference held in Portland, Oregon. The festival brings together creative individuals from various fields, such as design, technology, and independent arts, to share ideas and celebrate their work. 2. Kickstarter: Baio served as the Chief Technology Officer (CTO) for Kickstarter, a crowdfunding platform for creative projects, from 2009 to 2010. He played a significant role in the early development of the platform, which has since helped fund thousands of projects across various fields. 3. Upcoming.org: In 2003, Baio founded Upcoming.org, a collaborative event calendar that allowed users to share and discover upcoming events in their area. Yahoo! acquired Upcoming.org in 2005, and Baio worked at Yahoo! until 2007. The site was eventually shut down by Yahoo! in 2013, and Baio later relaunched it as an independent project in 2017.
  https://cdn.masto.host/fedisimonwillisonnet/media_attachments/files/110/022/940/801/645/330/original/759f1435f2d92533.png

Before

Public

Notices by Simon Willison (simon@fedi.simonwillison.net)

User actions

Following 0

Followers 0

Groups 0

Statistics

Feeds