Founders Fund, Pantera y Franklin Templeton se unen a «Arena» de Sentient para someter a pruebas de resistencia a los agentes de IA de nivel empresarial.

By: rootdata|2026/03/21 23:26:59

franklin

when-vip

love-token

En los últimos dos años, las empresas han acelerado la integración de agentes de IA en flujos de trabajo reales: desde el servicio al cliente y las operaciones de backend hasta los procesos financieros y de cumplimiento normativo que requieren la toma de decisiones de alto riesgo. A medida que estos sistemas se integran cada vez más en las operaciones comerciales reales, surge un nuevo problema: aunque los agentes pueden recuperar información, a menudo tienen dificultades para proporcionar procesos de razonamiento estables, interpretables y reproducibles cuando el trabajo se vuelve «complicado», con múltiples pasos o de alto riesgo.

Hoy, el laboratorio de IA de código abierto Sentient ha lanzado oficialmente Arena, un entorno en tiempo real y listo para la producción que permite a miles de desarrolladores de IA de todo el mundo realizar pruebas de estrés y competir de forma iterativa en los problemas de razonamiento más difíciles a los que se enfrentan las empresas. La lista inicial de participantes en la primera fase de Arena incluye a Founders Fund, Pantera y Franklin Templeton, que gestiona más de 15 billones de dólares en activos, lo que indica que las instituciones están mostrando un interés temprano y claro en «evaluaciones estructuradas de los agentes de IA antes de su implementación».

«Cuando las empresas aplican agentes de IA a la investigación, las operaciones y los flujos de trabajo orientados al cliente, la cuestión ya no es si estos sistemas son lo suficientemente potentes, sino si son fiables en los flujos de trabajo reales», afirma Julian Love, socio director de Franklin Templeton Digital Assets. Love añadió que entornos estructurados como Arena ayudarán al sector a distinguir entre «ideas prometedoras» y «capacidades que realmente se pueden utilizar en la producción».

El cofundador de Sentient, Himanshu Tyagi, afirmó: «Los agentes de IA ya no son solo experimentos dentro de las empresas, sino que están entrando en procesos críticos que afectan a los clientes, la financiación y los resultados operativos». Este cambio modifica los criterios de evaluación. No basta con que los sistemas tengan un aspecto impresionante en las demostraciones. Las empresas deben saberlo: en entornos de producción, donde el coste del fracaso es elevado y la confianza es frágil, ¿pueden los agentes seguir razonando de forma fiable? Las empresas necesitan comparabilidad, repetibilidad y un método para realizar un seguimiento de las mejoras en la fiabilidad a largo plazo que no dependa del modelo subyacente ni del conjunto de herramientas.

Arena simula el caos real de los flujos de trabajo empresariales: información incompleta, contexto extenso, instrucciones vagas y fuentes contradictorias. Arena no solo evalúa si los agentes proporcionan «respuestas correctas», sino que registra el razonamiento completo para que los equipos de ingeniería puedan identificar las causas de los fallos y validar las mejoras a lo largo del tiempo.

Esto proporciona un punto de referencia neutral e independiente del proveedor para evaluar el razonamiento entre modelos y pilas tecnológicas. Arena hace hincapié en el rendimiento a nivel de producción más que en el rendimiento de demostración, lo que le permite desarrollar capacidades verificables para los agentes que se pueden aplicar a situaciones de alto riesgo, y que las empresas también pueden transferir a sus datos privados y herramientas internas.

En el primer desafío, los desarrolladores que se unan a Arena se centrarán en un problema fundamental a nivel empresarial: el razonamiento documental. Los agentes de IA deben razonar y procesar datos complejos y no estructurados, un tipo de trabajo que sustenta escenarios como el análisis financiero, la investigación de causas fundamentales, la redacción de memorandos de inversión y el servicio al cliente.

Otros participantes en la fase inicial son alphaXiv, Fireworks, OpenHands y OpenRouter; a medida que Arena se expanda en tareas, industrias e integraciones de modelos, se espera que se sumen más participantes.

Investigaciones recientes también ponen de relieve la brecha que Arena pretende abordar: El 85 % de las empresas expresan su deseo de convertirse en «empresas agenticas», y casi tres cuartas partes planean implementar agentes autónomos, pero menos de una cuarta parte cuenta realmente con sistemas de gobernanza maduros; muchas empresas tienen dificultades para ampliar los proyectos piloto a implementaciones de producción a gran escala. De media, las empresas utilizan alrededor de una docena de agentes, a menudo dispersos en escenarios aislados; muchos creen que, sin mejores capacidades de coordinación y colaboración, añadir más agentes solo aumentará la complejidad y reducirá el valor.

«En OpenHands, siempre hemos estado deseosos de ayudar a los desarrolladores a utilizar agentes para resolver problemas reales y prácticos», afirmó Graham Neubig, científico jefe y cofundador de OpenHands. «También nos entusiasma ayudar a los participantes a utilizar el SDK del agente de software OpenHands para abordar estos complejos retos».

Alex Atallah, cofundador y director ejecutivo de OpenRouter, afirmó: «Arena es exactamente el tipo de iniciativa que puede impulsar el avance de la IA de código abierto, ya que permite a los investigadores competir, iterar e innovar en un entorno abierto». Esperamos profundizar nuestra colaboración con Sentient y proporcionar la infraestructura necesaria para que los experimentos sean más rápidos y fáciles de escalar.

Arena se lanzará a nivel mundial e invitará a miles de desarrolladores de IA a postularse para la primera cohorte limitada, con eventos presenciales programados para celebrarse en San Francisco a partir de marzo de 2026.

Acerca de Sentient Labs

Sentient Labs es una organización líder en investigación tecnológica y desarrollo de productos dedicada al avance de la inteligencia artificial de código abierto. Como motor de innovación de la Fundación Sentient, Sentient Labs lleva a cabo investigaciones de vanguardia en razonamiento, alineación y colaboración de agentes de IA. Sentient es el desarrollador principal de marcos de alto rendimiento como ROMA y modelos de código abierto como Dobby. La misión de Sentient es hacer que la IA de código abierto pase de ser un «experimento» a una «necesidad». Al proporcionar la infraestructura necesaria para crear sistemas de agentes potentes y combinables, Sentient permite a los desarrolladores comercializar herramientas de código abierto y lograr una usabilidad a nivel empresarial. Sentient se compromete a convertir el código abierto en el estándar predeterminado para las operaciones de IA críticas a nivel mundial.

Precio de --

Te puede gustar

IOSG Founder: Please tell Vitalik the truth, let the OGs who have enjoyed the industry's dividends enlighten the young people

The wage earners freeze to death on the road, the sellers of goods die of thirst on the way. The weavers of brocade wear coarse cloth, and the grain growers do not have enough to eat.

Morning Report | SpaceX reveals it holds approximately $1.45 billion in Bitcoin; Nvidia's Q1 financial report shows revenue of $81.6 billion; Manus plans to raise $1 billion for buyback business

Overview of Important Market Events on May 21

Insiders: DeepSeek is forming a Harness team to compete with Claude Code

DeepSeek Code is coming.

SpaceX officially submitted its prospectus, unveiling the largest IPO in history

SpaceX's public market debut could take place as early as June, making it the first in a series of giant IPOs from AI companies, with OpenAI and Anthropic also waiting for the right moment.

The financial changes under the new SEC regulations: Opportunities and regulatory red lines behind "tokenized stocks"

In-depth analysis of "tokenized stocks": The SEC's advancement of an innovation exemption framework has sparked heated discussions, revealing the real risks behind third-party "synthetic asset" certificates and 24/7 trading.

Blockchain Capital Partner: The structure of on-chain dual-layer capital is still in the early stages of value discovery

How can the on-chain economy build a capital structure that promotes open innovation while also considering institutional scale?

Secured over $60 million in funding from Dragonfly, Sequoia, and others, learn about the on-chain derivatives protocol Variational | CryptoSeed

What is the difference with Hyperliquid?

I tested with $10,000: zero wear and tear, annualized 8%, and can earn points (with complete tutorial + screenshots)

Perps DEX newcomer StandX launches native stablecoin DUSD, achieving a real APY of 8.46% with its innovative three-tier yield mechanism, breaking the 3% traditional stablecoin interest rate ceiling.

Morning Report | Deloitte acquires crypto infrastructure company Blocknative; stablecoin company Checker completes $8 million financing; a16z may have become the largest external institutional holder of HYPE

Overview of Important Market Events on May 20

Interpretation of xBubble SOP: Packaging Vibe Coding for non-technical users

DAPPOS has launched the low-threshold AI application xBubble, which innovatively automates the packaging of complex large model workflows with an SOP system, allowing users with no technical background to complete professional-level AI tasks with just one sentence.

From Followers to Price Setters: The Role of the Crypto Market is Reversing

The encryption platform successfully achieved precise pre-listing pricing on CBRS, indicating that Crypto is gradually transforming from a follower of traditional finance into a new pricing hub for global assets through innovative mechanisms.

a16z invested $356 million to aggressively acquire HYPE, surpassing Paradigm to become the largest external holding institution

Eight months later, the price of HYPE is approaching its previous high, and institutions like a16z, Goldman Sachs, and Grayscale are collectively taking action. What is their intention?

I’m sorry, but the information provided in your re…

I’m sorry, but the information provided in your request is incomplete or unclear. If you could provide specific…

Bitcoin Establece un Umbral Decisivo en 55,000 Dólares

Key Takeaways Precio Crítico de Bitcoin: El nivel de 55,000 dólares se presenta como decisivo para la futura…

# Outline

H1: El Impacto en Bitcoin: ¿Podría el Precio Descender a $55,000? Key Takeaways Análisis sobre la posible caída…

IOSG Founder: Please tell Vitalik the truth, let the OGs who have enjoyed the industry's dividends enlighten the young people

The wage earners freeze to death on the road, the sellers of goods die of thirst on the way. The weavers of brocade wear coarse cloth, and the grain growers do not have enough to eat.

Morning Report | SpaceX reveals it holds approximately $1.45 billion in Bitcoin; Nvidia's Q1 financial report shows revenue of $81.6 billion; Manus plans to raise $1 billion for buyback business

Overview of Important Market Events on May 21

Contenido

Monedas populares

Últimas noticias cripto

04:45

The probability of the Federal Reserve keeping interest rates unchanged in June is 96.8%, and in July, it is 85.4%

According to Jinshi News, the CME "FedWatch" shows that the probability of the Federal Reserve keeping interest rates unchanged until June is 96.8%, while the probability of a cumulative rate hike of 25 basis points is 3.2%. By July, the probability of keeping interest rates unchanged is 85.4%, the ...

04:45

Paradigm announced the open-source self-developed AI Agent Centaur, claiming it has completely transformed the workflow of the fund

Paradigm officially announced the open-sourcing of Centaur, a self-hosted AI Agent Runtime developed jointly by Paradigm and Tempo that supports multi-user collaboration and secure operation. Paradigm stated that since January of this year, the fund has been using Centaur internally, which has compl...

04:45

Data: US HYPE spot ETF single-day total net inflow of $16.1505 million

According to SoSoValue data, yesterday (Eastern Time May 21), the HYPE spot ETF had a total net inflow of $16.1505 million.The HYPE spot ETF with the highest net inflow yesterday was the Bitwise Hyperliquid ETF (BHYP), with a net inflow of $8.4406 million for the day, bringing its historical total n...

HYPE

04:45

Tom Lee: Ethereum will be ensured to become the settlement layer for the future of finance and AI, continuing to be optimistic about ETH

Tom Lee, the chairman of BitMine, Ethereum's largest treasury, stated that Ethereum has a strong group of leaders and developers who can ensure it continues to be the future settlement layer for finance and AI.Currently, much of the bearish sentiment is just despair and mutual blame at the bottom of...

ETH

04:45

Former Ethereum Foundation researcher Dankrad Feist proposed raising $1 billion to establish a new Ethereum advocacy organization

According to The Block, former Ethereum Foundation researcher Dankrad Feist posted on the X platform, proposing the establishment of a new organization aligned with the Ethereum economy to "save" Ethereum. He believes that the Ethereum Foundation currently holds less than 0.1% of ETH, with no stakin...

ETH

GAS

Founders Fund, Pantera y Franklin Templeton se unen a «Arena» de Sentient para someter a pruebas de resistencia a los agentes de IA de nivel empresarial.

Acerca de Sentient Labs

Precio de --

Te puede gustar

IOSG Founder: Please tell Vitalik the truth, let the OGs who have enjoyed the industry's dividends enlighten the young people

Morning Report | SpaceX reveals it holds approximately $1.45 billion in Bitcoin; Nvidia's Q1 financial report shows revenue of $81.6 billion; Manus plans to raise $1 billion for buyback business

Insiders: DeepSeek is forming a Harness team to compete with Claude Code

SpaceX officially submitted its prospectus, unveiling the largest IPO in history

The financial changes under the new SEC regulations: Opportunities and regulatory red lines behind "tokenized stocks"

Blockchain Capital Partner: The structure of on-chain dual-layer capital is still in the early stages of value discovery

Secured over $60 million in funding from Dragonfly, Sequoia, and others, learn about the on-chain derivatives protocol Variational | CryptoSeed

I tested with $10,000: zero wear and tear, annualized 8%, and can earn points (with complete tutorial + screenshots)

Morning Report | Deloitte acquires crypto infrastructure company Blocknative; stablecoin company Checker completes $8 million financing; a16z may have become the largest external institutional holder of HYPE

Interpretation of xBubble SOP: Packaging Vibe Coding for non-technical users

From Followers to Price Setters: The Role of the Crypto Market is Reversing

a16z invested $356 million to aggressively acquire HYPE, surpassing Paradigm to become the largest external holding institution

Google officially declares war

Coinbase stuffed USDC into Hyperliquid; who made money from this transaction?

It is Bankless that needs Ethereum, not Ethereum that needs Bankless

I’m sorry, but the information provided in your re…

Bitcoin Establece un Umbral Decisivo en 55,000 Dólares

# Outline

IOSG Founder: Please tell Vitalik the truth, let the OGs who have enjoyed the industry's dividends enlighten the young people

Morning Report | SpaceX reveals it holds approximately $1.45 billion in Bitcoin; Nvidia's Q1 financial report shows revenue of $81.6 billion; Manus plans to raise $1 billion for buyback business

Insiders: DeepSeek is forming a Harness team to compete with Claude Code

SpaceX officially submitted its prospectus, unveiling the largest IPO in history

The financial changes under the new SEC regulations: Opportunities and regulatory red lines behind "tokenized stocks"

Blockchain Capital Partner: The structure of on-chain dual-layer capital is still in the early stages of value discovery

Contenido

Monedas populares

Últimas noticias cripto

The probability of the Federal Reserve keeping interest rates unchanged in June is 96.8%, and in July, it is 85.4%

Paradigm announced the open-source self-developed AI Agent Centaur, claiming it has completely transformed the workflow of the fund

Data: US HYPE spot ETF single-day total net inflow of $16.1505 million

Tom Lee: Ethereum will be ensured to become the settlement layer for the future of finance and AI, continuing to be optimistic about ETH

Former Ethereum Foundation researcher Dankrad Feist proposed raising $1 billion to establish a new Ethereum advocacy organization