PauseAI•16mo ago

Building the Pause button: researching how to pause effectively.

@Remmelt asked me to submit a research project for the AI Safety Camp (AISC). There is one research question that I feel like we should have a very good, robust answer to: How do we build the Pause button? What do we need in order to effectively halt frontier AI research?

Of course, we already have quite a bit of thoughts on this, but there are also open questions that we need answers to.

So @johandekock and I have submitted a research plan for AISC.

In this thread / project, we'll try to keep everyone up to date!

Current status

@Farhan will probably be team lead, @dogwglasses and @Joep Meindertsma are team members. We're still interviewing other members using the AISC Airtable.
We're updating the research plan and the Building the Pause Button page as we go along.
We'll formally start in January

## How you can help

Share research about Compute governance. Some research has been published by Gov.AI and ICFG.
Come up with novel ideas on verification mechanisms to halt AI training runs. Here's a paper by Akash Wasil.
Share flaws in existing approaches
Join the team!

spotty-amber•9/28/24, 9:41 PM

international cooperation is key here imo

spotty-amber•9/28/24, 9:42 PM

also important to reach out to folk who disagree and find common ground

dead-brown•9/29/24, 1:29 AM

I love this idea

worthy-azure•9/29/24, 2:06 AM

BTW, does anyone know anyone who could help lead this compute research project? I'm really excited about this project as an AI Safety Camp organiser, but Joep is already overstretched.

sad-indigo•10/1/24, 3:35 AM

not sure if the project lead has been all figured out, but would love to help in any capacity needed!

hurt-tomatoOP•10/17/24, 12:46 PM

Just met John Khan and Annelene Schulz, who are interning at @Otto's Existential Risk Observatory. They will be researching how can we pause for a long term. In the short term, pausing is relatively simple, due to the scale of AI training runs. But various innovations can make this more difficult on the long term:

Decentralized training runs
Quantum Computers
Photonic chips
Second-hand computers
Algorithmic advances

hurt-tomatoOP•10/17/24, 9:13 PM

Nice video on this very topic! https://youtu.be/M4DAfzDnJzU?si=dcyvH6UblbBgp7Hb

YouTubeDr Waku

Can we escape Moloch’s trap with a GPU treaty?

In competitive situations, people end up sacrificing common values to get an edge. But this edge only lasts a short time until everyone does the same, so everyone is right back where they started but the common value is gone forever. Moloch is the personification of this “race to the bottom”. When asked why situations that are bad for everyone a...

hurt-tomatoOP•11/1/24, 3:46 PM

Wrote an article: https://pauseai.info/building-the-pause-button

PauseAI

Building the Pause Button

What would an AI Pause look like? How do you continue to actually prevent a superintelligence from being created?

hurt-tomatoOP•11/4/24, 7:26 PM

Projeect on the AI Safety Camp website: https://www.aisafety.camp/#h.sy274smw8x44

Home

Apply to join the
10th AI Safety Camp

Hhurt-tomato Wrote an article: https://pauseai.info/building-the-pause-button

worthy-azure•11/5/24, 5:22 AM

This is a great read btw!

Hhurt-tomato Wrote an article: https://pauseai.info/building-the-pause-button

skinny-azure•11/6/24, 12:34 PM

Great read! One thing I noticed: this should probably be FlexHEGs, not FlegHEGs.

Sskinny-azure Great read! One thing I noticed: this should probably be Fle**x**HEGs, not FlegH...

hurt-tomatoOP•11/6/24, 12:36 PM

Thanks! will fix

hurt-tomatoOP•11/14/24, 8:29 AM

I'm impressed by some of the AISC applicants! We'll probably have a great team

hurt-tomatoOP•11/14/24, 8:29 AM

And thanks to @dogwglasses for doing a lot helping out!

spotty-amber•11/14/24, 8:56 AM

Scheduling chats with 5 interesting applicants, will keep the channel updated on how they go!

spotty-amber•11/14/24, 8:56 AM

Excited for the direction of this project

spotty-amber•11/15/24, 5:20 AM

4 out of 5 peeps replied, I got 2 chats Friday (one in under 12 hours!), 1 chat next Monday, 1 chat Tuesday

spotty-amber•11/15/24, 5:20 AM

@Joep Meindertsma Any advice for good info to discuss in the intro call?

spotty-amber•11/15/24, 5:20 AM

I was gonna introduce the project, ask if it sounds like its interesting, what kind of work of sub questions they see themselves doing

spotty-amber•11/15/24, 5:21 AM

if theyre leaning towards any other projects (since everyone im calling is 2nd choice and other)

spotty-amber•11/15/24, 5:27 AM

make it clear what this project is and is not

spotty-amber•11/15/24, 5:27 AM

(added some scope clarifications in the doc i think are correct based on my understanding)

hurt-tomatoOP•11/15/24, 9:00 AM

That's great!

worthy-azure•11/15/24, 4:13 PM

Cool to see your work here @dogwglasses!

You can also ask for what they see as what’s important to get right for the project. Challenge them to give their own views ^^

Interviews are useful for asking questions that can resolve uncertainties you still have about how to interpret their application.

Also nice to conversationally get to know each other and what the project is about. I like your points about scope and where they want to contribute to team.

spotty-amber•11/16/24, 8:45 AM

Chatted with Gideon. Seem like he was excited about the project and has some experience working on supply chain research and would be most interested in question 3 (What choke points exist in the AI chip supply chain?)

hurt-tomatoOP•11/18/24, 8:54 AM

@Farhan This is the project page!

hurt-tomatoOP•11/18/24, 3:21 PM

https://www.cnas.org/publications/reports/secure-governable-chips

CNAS

Secure, Governable Chips

Developing strong, pragmatic and principled national security and defense policies.

hurt-tomatoOP•11/18/24, 3:38 PM

@Ananthi Al Ramiah is interested in joining!

instant-harlequin•11/18/24, 4:39 PM

Hi! Thanks Joep. Yes this project is my 1st choice for AISC. I only get my application in last night so it may not be on your radar yet. I had an interesting conversation with Joep this morning discussing the critical role that understanding the psychology and constraints of stakeholders makes when trying to create big mindset and policy shifts. Hopefully the white paper that this project creates can be both practically actionable and also psychologically resonant for the people it is aimed at.

spotty-amber•11/18/24, 4:56 PM

Chatted with Dominika. She brings a ton of experience including a PhD centered around international military/security/drone warfare and is most interested in the 2nd hand chips research question. She seems genuinely interested in our work and would bring valuable insights and a unique perspective to the team.

Sspotty-amber Chatted with Dominika. She brings a ton of experience including a PhD centered a...

hurt-tomatoOP•11/18/24, 7:25 PM

You should also chat with @Ananthi Al Ramiah!

spotty-amber•11/18/24, 11:56 PM

Did the airtable list change recently? There's way more entries than I remember

spotty-amber•11/19/24, 12:21 AM

Spoke with Jialu. Her background is a PhD focused around game theory and market theory. Meeting went well, I think she was more seriously considering the project after I pitched it to her (since it didn't seem to be her first choice initially)

scattered-teal•11/19/24, 3:57 AM

Thanks for adding me to the project @Joep Meindertsma . I am already enjoying the energy here. @dogwglasses Thanks for the very quick action on speaking to interested candidates.

scattered-teal•11/19/24, 3:57 AM

Looking forward to our call tomorrow.

Sspotty-amber Did the airtable list change recently? There's way more entries than I remember

hurt-tomatoOP•11/19/24, 10:13 AM

Yeah lots of new submissions!

hurt-tomatoOP•11/19/24, 10:13 AM

Deadline was sunday, so we should not expect new submissions.

scattered-teal•11/19/24, 6:28 PM

Wanted to run an idea by everyone regarding the framing and breakdown of our project deliverables. Given that their is a fair bit of complexity I propose we break it down into 2 analysis and 1 correlation.
Life of a AI chip : Builds a map/life cycle of what actors and parties are involved in the supply chain of a typical chip used for training AI.
Life of a policy: Builds a map/lifecycle of a typical policy of similar nature.
Building a pause button: combines the two reports into an actionable framework for pause AI.

The rationale here is to have incremental outputs such that the first two reports ae predominantly facts based and we have a satisfactory level of expertise on the subject matter, to base our final output on. The last report then has the freedom to be a bit more aspirational.
It also gives us a positive reinforcement feedback loop :).
We may be able to find some existing work on the first 2 reports already.

Do we see any merits and/or issues with this approach?

spotty-amber•11/19/24, 7:50 PM

I like the approach! Small clarification on the life of policy though, my impression is that policy is so wildly varied that they may lend themselves to a typical lifecycle. Are there any specific examples of policies that we should look into? (I'm no policy expert, just my 0.02c)

spotty-amber•11/19/24, 7:52 PM

Also just spoke with Mitali, she's an undergrad based in California who's excited to join. She doesnt have much professional experience yet but has experience in software engineering and is super interested in learning more about the AI policy and regulation space.

hurt-tomatoOP•11/23/24, 5:08 PM

Open Problems in Technical AI Governance

spotty-amber•11/24/24, 5:41 AM

Small asynch updates from my end:

I read the meeting notes. Just so we don't duplicate efforts, I already got Dominika on board! She is committed to the project

spotty-amber•11/24/24, 5:42 AM

Right now the 2 folks who are committed to joining are Dominika and Jiawei

spotty-amber•11/24/24, 5:42 AM

@Farhan happy to exchange their info if you'd like to schedule a follow up w/ them :]

spotty-amber•11/24/24, 5:42 AM

@Joep Meindertsma

spotty-amber•11/24/24, 5:42 AM

Keep up the good work

scattered-teal•11/24/24, 9:16 AM

Awesome work Jim. I was quite impressed with Dominika, so was hoping she would join. Do you have any notes on the calls?

instant-harlequin•11/25/24, 5:48 PM

Hi guys. Not sure if I should be in this chat room as you are discussing candidates.

I listed this project as my first choice. I guess you are working your way through applicants, and I really hope we can have a chat/interview and that you will see some value in what I can bring as a policy-oriented social scientist/social psychologist to this particular project.

Let me know if it would be better if I was not in this chat room? I don't mind either way!

Sscattered-teal Awesome work Jim. I was quite impressed with Dominika, so was hoping she would j...

spotty-amber•11/26/24, 12:23 AM

Yes! Not super detailed since it was mainly a convo but here were my notes on Dominika

spotty-amber•11/26/24, 12:24 AM

-Most interested in 2nd hand chip
-10 years of experience as phd/researcher on russo-ukraine drone warfare/role of AI in war
-Interested in military/security domain

Building the Pause button: researching how to pause effectively.

Current status

Similar Threads

Similar Threads

Similar Threads