- Develop, iterate, and refine prompts to optimize LLM outputs, ensuring quality, relevance, and performance across various applications
- Utilize both quantitative and qualitative methodologies to assess LLM performance, identifying areas for improvement and implementing data-driven enhancements
- Collaborate with data scientists, software engineers, and product managers to understand project requirements, define model objectives, and deliver prompt-based solutions that meet diverse user needs
- Conduct experiments, A/B tests, and in-depth analysis to evaluate LLM responses, fine-tuning prompts and strategies to maximize accuracy and minimize errors
- Maintain a detailed knowledge of recent advancements in LLMs, contributing to the teamβs understanding of best practices and emerging capabilities
- Apply creative and flexible thinking to troubleshoot complex prompt engineering challenges, adapting to different contexts and applications as needed
- Proven experience working with LLMs, including Open AIβs GPT, Metaβs LLaMA, or similar platforms
- Proficiency in the quantitative and qualitative evaluation of language model performance, with a strong analytical mindset and attention to detail
- Familiarity with tools and methods for prompt testing, prompt engineering best practices, and LLM optimization
- Excellent problem-solving skills with a creative approach to developing new prompt structures and formats
- Strong communication and collaboration skills, with the ability to work effectively in a team environment
- Background in machine learning, natural language processing, or a related field
- Coding skills in Python or other languages relevant to AI model manipulation and analysis
- A proactive approach to staying updated on advancements in LLMs, machine learning, and AI ethics
TBD