LLM generalist

We are a nonprofit raising policymaker awareness about AI risks, aiming to slow potentially dangerous research and accelerate technical safety. We have ambitious goals and need to move fast. Come work with us! About Palisade

Here's what your work as an LLM generalist could look like:

Pull a project from our ideas backlog or come up with your own. e.g.:

I think GPT-4 could do binary exploitation to the level of OSCP certificate. Concretely, I think it could hack all of crackmes.one level 3 challenges.
Develop the experiment end-to-end, including its design (how to measure the thing?) and requisite harness (Python code), then run the experiment and ocllect data.
Write your results up for publication on arXiv, as a landing page, or as a report for policymakers.

Our collaboration process looks like:

We post daily statuses for each other to keep in sync regarding our directions. Each project has two sync meetings per week to keep it on track; we have an all-hands demo meet every two weeks.
We propose new ideas or directions by writing up a doc, sharing it, and getting comments. This enables async communication.
Our median response time to each other is in hours, not minutes; we work in an independent and self-directed fashion. Your supervisor helps you maintain direction; colleagues help with the implementation; you keep track of your tasks and milestones.

Here are the key traits one needs to succeed in this role:

Excellent Python proficiency: it's important for coding not to get in your way while doing research.
Experience with LLM engineering: prompting, scaffolding, CoT/ToT, RAG, tool calling, agent design.
Strong writing skills.
Aptitude for self-directed, high-agency work. You take initiative and contribute proactively; we don’t micromanage.
Aptitude for cross-functional collaboration and learning. You do what it takes to ship your work.
Motivation to conduct research that is both curiosity-driven and addresses concrete open questions in AI risk.

Hiring process

Apply with a CV and a cover letter. In the cover letter:
1. Provide evidence of aptitude for self-directed high-agency work (<150 words)
2. Provide evidence of exceptional ability (<150 words)
Optional coding test
Interview
Paid trial, 1-2 weeks

	Compensation, $/mo
Intern	$1000
Middle	$3000
Senior	$5000

This is a remote position. Apply with this form.