AI Edition - It's AI Time - 199 [31-10-2024]
AI Edition - It's AI Time - 199 Pushing the Frontiers of Audio Generation DeepMind has talked a little bit more about the audio generation models used to power NotebookLM. OpenAI's new hallucination benchmark OpenAI has released the SimpleQA benchmark, which measures models' abilities around simple factual questions. Evaluating feature steering: A case study in mitigating social biases The feature steering in AI models to interpretably modify outputs. It reveals a "steering sweet spot", where changes do not degrade capabilities. The study results show steering can alter social bias in targeted domains but also brings unexpected off-target effects. Further research is required to refine feature steering for safer, more reliable outcomes in AI models.