AI Edition - It's AI Time - 296 [05-02-2025]

AI Edition - It's AI Time - 296

Hugging Face Replicating OpenAI's Deep Research

Hugging Face attempted to replicate OpenAI's Deep Research, an agentic web-search framework that significantly improved performance on the GAIA benchmark, by running a 24-hour-long experiment aimed at open-sourcing an equivalent system.

Google CEO on DeepSeek vs. Gemini

Sundar Pichai has downplayed the efficiency of DeepSeek's AI models, arguing that Google's Gemini models, particularly Gemini 2.0 Flash, outperform them despite DeepSeek's disruptive impact on the AI market.

US Copyright Office rules out copyright for AI created content without human input

The US Copyright Office states that AI-generated works without human intervention cannot be copyrighted. AI tools assisting with creativity, like de-aging actors, won't limit copyright protection, but purely generative AI outputs require further analysis.

Open-Vocabulary Detection with LLMs

LLMDet is an open-vocabulary detector that leverages a large language model for enhanced caption generation and grounding that significantly boosts performance compared to existing detectors.

Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

Learning-rate schedules for large models closely match theoretical bounds from non-smooth convex optimization. These authors provide a bound for constant schedules with linear cooldown, showing cooldown's practical benefits through the absence of logarithmic terms in the bound. Their findings enabled practical improvements in training Llama-type models through optimal learning-rate extension and cross-schedule transfer.

Comments

Popular posts from this blog

AI Edition - It's AI Time - 231 [02-12-2024]

AI Edition - It's AI Time - 16 [30-04-2024]

AI Edition - It's AI Time - 140 [02-09-2024]