• loonsun@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    4
    ·
    5 hours ago

    It’s about Agents, which implies multi step as those are meant to execute a series of tasks opposed to studies looking at base LLM model performance.