ML/NLP specialist

We are a nonprofit raising policymaker awareness about AI risks, aiming to slow potentially dangerous research and accelerate technical safety. We have ambitious goals and need to move fast. Come work with us! About Palisade

Here's what your work as a ML/NLP specialist could look like:

Build a Whisper + Llama adversarial attack and transfer it to Whisper + GPT.
Reproduce a paper and extend on it, e.g. https://arxiv.org/abs/2408.00761#.
Refactor and open-source Badllama evals code (see https://arxiv.org/abs/2407.01376v1, section 5).

Our collaboration process:

We post daily statuses for each other to keep in sync regarding our directions. Each project has two sync meetings per week to keep it on track; we have an all-hands demo meet every two weeks.
We propose new ideas or directions by writing up a doc, sharing it, and getting comments. This enables async communication.
Our median response time to each other is in hours, not minutes; we work in an independent and self-directed fashion. Your supervisor helps you maintain direction; colleagues help with the implementation; you keep track of your tasks and milestones.

Here are the key traits one needs to succeed in this role:

Excellent Python and NLP ecosystem proficiency: it's important for coding to not get into your way while doing research.
Strong writing skills.
Aptitude for self-directed, high-agency work. You take initiative and contribute proactively; we don’t micromanage.
Aptitude for cross-functional collaboration and learning. You do what it takes to ship your work.
Motivation to conduct research that is both curiosity-driven and addresses concrete open questions in AI risk.

Hiring process

Apply with a CV and a cover letter. In the cover letter:
1. Provide evidence of aptitude for self-directed high-agency work (<150 words)
2. Provide evidence of exceptional ability (<150 words)
Optional coding test
Interview
Paid trial, 1-2 weeks

	Compensation, $/mo
Intern	$1000
Middle	$3000
Senior	$5000

This is a remote position. Apply with this form.