Patronus AI is the leading automated AI evaluation and security company. We are on a mission to boost enterprise confidence in generative AI. Our world-class platform enables enterprise development teams to score LLM performance, generate adversarial test cases, benchmark LLMs, and more. Customers use Patronus AI to detect LLM mistakes at scale and deploy AI products safely and confidently.
Our founding team comes from top applied ML and research backgrounds, including Facebook AI Research (FAIR), Airbnb, Meta Reality Labs, and quant finance. As a team, we have published research papers at top ML conferences (NeurIPS, EMNLP, ACL), designed and launched Airbnb’s first conversational AI assistant, pioneered causal inference at Meta Reality Labs, exited a quant hedge fund backed by Mark Cuban, and scaled 0→1 products at high growth startups. We are backed by Lightspeed Venture Partners and high profile operators like Amjad Masad, Gokul Rajaram, and Fortune 500 executives and board members. We are advised by Douwe Kiela, Adjunct Professor at Stanford University and former Head of Research at HuggingFace.
As the QA Engineer for Patronus AI, you are responsible for ensuring the quality and reliability of our AI products and systems. You will work closely with our engineering, AI and design teams to identify and resolve any potential issues or bugs, create testing plans and datasets, ensuring that our products meet the highest standards of performance and functionality. Successful applicants are skilled, have prior QA experience and are detail-oriented.
In this role, you will:
“The number one qualification to succeed in this machine learning course is gumption” - John Lafferty, CS Professor at Yale
Above all, we look for a proactive mindset, willingness to learn, relentless drive, and passion for engineering and product. You are a great fit if you have a background in the following:
Please be informed that the data controller is Patronus AI, Inc., a Delaware corporation. (hereinafter "controller"). Yo...more