Research Scientist
FAR AI
📍Remote (Global) 🕔 Full Time
💰$80,000-$175,000 USD /year 🔄 Rolling Applications
We are seeking expressions of interest from both experienced and first-time Research Scientists to develop and execute on a safety research agenda and/or accelerate our existing projects. While we will review all submissions, we currently can only reach out to candidates who are a strong fit for the role. We will keep all other submissions and reach out when we can interview more candidates. Thank you!
About the Role
We are seeking applications from potential Research Scientists who can:
Take ownership of and accelerate existing AI Alignment research agendas.
Develop their own exciting AI Alignment research agendas.
Lead novel research projects where there may be unclear markers of progress and/or success.
Contribute to the development of best practices for AI safety research at FAR.AI and in the broader community.
Publish research findings and engage with the AI safety community.
About You
We are excited by unconventional backgrounds. You may have the following:
New and under-explored AI Alignment idea(s).
Experience leading and/or playing a senior role in research projects related to machine learning.
Ability to effectively communicate novel methods and solutions to both technical and non-technical audiences.
PhD or several years research experience in computer science, artificial intelligence, machine learning or statistics.
About the Projects
As Research Scientist you would lead AI safety research projects or make essential contributions to existing projects. Examples of ongoing projects at FAR.AI include:
Scaling laws for prompt injections. Will advances in capabilities from increasing model and data scale help resolve prompt injections or “jailbreaks” in language models, or is progress in average-case performance orthogonal to worst-case robustness?
Robustness of advanced AI systems. Explore adversarial training, architectural improvements and other changes to deep learning systems to improve their robustness. We are exploring this both in zero-sum board games and language models.
Mechanistic interpretability for mesa-optimization. Develop techniques to identify internal planning in models to effectively audit the “goals” of models in addition to their external behavior.
Red-teaming of frontier models. Apply our research insights to test for vulnerabilities and limitations of frontier AI models prior to deployment.
Logistics
You will be an employee of FAR.AI, a 501(c)(3) research non-profit.
Location: Both remote and in-person (Berkeley, CA) are possible. We sponsor visas for in-person employees, and can also hire remotely in most countries.
Hours: Full-time (40 hours/week).
Compensation: $100,000-$175,000/year depending on experience and location. We will also pay for work-related travel and equipment expenses. We offer catered lunch and dinner at our offices in Berkeley.
Application Process: A 72-minute programming assessment, a short screening call, two 1-hour interviews, and a 1-2 week paid work trial. If you are not available for a work trial we may be able to find alternative ways of testing your fit.
If you have any questions about the role, please do get in touch at talent@far.ai.