§01

>> /vagas · tech: reinforcement learning from feedback

// vagas filtradas por tech = "reinforcement learning from feedback". remover filtro

STATUSPOSTADALOCALNÍVELVAGAEMPRESA
OPENhá 10 diasREMOTOPLai research engineer (agentic post-training) - 100% remote worldwide // distributed training frameworks · tethertether