About this project
it-programming / artificial-intelligence-1
Open
We are seeking an experienced ai/ml engineer or team to design and build a robust learning environment based on a 'substrate' foundation. This environment will be used for a Quality Assurance (qa) ai agent that will learn through reinforcement learning from human feedback (rlhf). The project requires the development of the core infrastructure, implementation of the RLHF mechanism, and integration of proper rubrics for human feedback and evaluation. The ideal freelancer will have a strong background in artificial intelligence, machine learning, and system architecture, with a focus on creating scalable and efficient learning systems. Deliverables include the complete environment setup, documentation, and a functional prototype of the qa ai agent's learning loop.
Category IT & Programming
Subcategory Artificial Intelligence
Project size Large
Delivery term: Not specified
Skills needed