Redwood City, October 23, 2024. Drawing on their deep knowledge of the brain and memory, the Tianqiao & Chrissy Chen Institute’s internal AI team achieved a major breakthrough in artificial intelligence, with their self-developed OMNE Multiagent Framework which took the top position on the GAIA (General AI Assistants) benchmark leaderboard (https://huggingface.co/spaces/gaia-benchmark/leaderboard), published by Hugging Face. OMNE outperformed frameworks from some of the world’s leading institutions, including Microsoft Research. This achievement builds on years of brain research at TCCI, equipping agents with Long-Term Memory (LTM) capabilities, which enable the framework to engage in deeper, slower thinking and enhance the decision-making capabilities of Large Language Models (LLMs) in complex problem-solving.
This milestone is a major accomplishment for TCCI’s AI team since the institute’s founder, former Chinese tech giant Tianqiao Chen, announced the “All-In AI Strategy” last year.
OMNE currently boasts an overall success rate of 40.53%, surpassing submissions from well-known institutions such as Meta, Microsoft, Hugging Face, Princeton University, the University of Hong Kong, the British AI Safety Research Institute, and Baichuan, among others. In comparison, GPT-4 equipped with plugins achieved a success rate of only 15%.
GAIA, co-launched by Meta AI, Hugging Face, and AutoGPT, is a benchmarking system designed to rigorously test AI assistants on real-world challenges. It evaluates core competencies such as Reasoning, Multi-Agent Coordination, Web Browsing, and Tool Usage. As one of the most demanding datasets for multi-agent intelligence, topping the GAIA leaderboard showcases the depth of Shanda’s AI expertise and their ability to push the boundaries of innovation.
OMNE is a multi-agent collaboration framework based on long-term memory (LTM). Each agent has the same and independent system structure and can autonomously learn and understand the complete world model, thereby independently understanding the environment. The multi-agent collaboration system based on LTM enables the AI system to adapt to individual behavior changes in real time, optimize task planning and execution, and promote personalized and efficient self-evolution.
The major breakthrough is the integration of long-term memory mechanism, which greatly reduces the search space of MCTS and improves the decision-making ability on complex problems. By introducing more efficient logical reasoning, OMNE not only improves the intelligence level of a single agent, but also significantly enhances the overall capabilities of the multi-agent system by optimizing the collaboration mechanism. This enhancement mechanism is inspired by the study of the columnar structure of the human cerebral cortex. As the basic unit of the brain’s cognitive and behavioral functions, the cortical column realizes information processing through a complex collaboration mechanism. By strengthening the collaboration between single intelligence and agents, the AI model may gradually produce the emergence of cognitive abilities, build an internal representation model, and then promote a leap in the overall intelligence of the system.
“We are incredibly proud to see OMNE top the GAIA leaderboard,” said the head of TCCI AI team. “This achievement demonstrates the vast potential of using long-term memory to drive AI self-evolution and solve real-world problems. We believe that advancing research in Long-Term Memory and AI self-evolution is crucial for the ongoing development and practical application of AI technologies.”
About the Tianqiao and Chrissy Chen Institute
The Tianqiao and Chrissy Chen Institute (the Chen Institute) was created in 2016 by Tianqiao Chen and his wife Chrissy Luo with a US $1 billion commitment to help advance brain science. The organization’s vision is to improve the human experience by understanding how our brains perceive, learn, and interact with the world.
The Chen Institute created the Tianqiao and Chrissy Chen Institute for Neuroscience at Caltech in 2016 and the Tianqiao Chen Institute for Translational Research, a partnership with the Shanghai Zhou Liangfu Medical Development Foundation, Huashan Hospital and Shanghai Mental Health Center in 2017. In 2020, the Chen Institute opened the Chen Frontier Lab for Applied Neurotechnology and in 2021 the Chen Frontier Lab for AI and Mental Health opened. In early 2023, the Chen Institute launched the Chen Scholars program which supports early- to mid-career physician scientists. The Institute has a strong focus on artificial intelligence due to its ability to accelerate the pace of scientific research.
Learn about the Chen Institute and Science Prize for AI Accelerated Research at ChenInstitute.org/Prize and follow our news at ChenInstitute.org, LinkedIn, or via X @ChenInstitute.
For information, please contact us at: contactus@cheninstitute.org
For more information on the GAIA benchmark, visit: [GAIA Benchmark Results] https://huggingface.co/spaces/gaia-benchmark/leaderboard
TCCI’s paper on AI Long-Term Memory, “Long Term Memory: The Foundation of AI Self-Evolution”, has been published on arXiv: https://arxiv.org/abs/2410.15665