Arize is an AI engineering platform focused on evaluation and observability. It helps engineers develop, evaluate, and observe AI applications and agents.
Arize AI
Loading...
Laurie Voss, head of DevRel at Arize, explains why separating those layers makes the whole category easier to reason about.
arize.com/blog/what-i...
Cursor users! A dozen incredibly helpful Arize skills are now available directly in Cursor from the Agent Marketplace! Select "Customize" from the agents sidebar to see the marketplace and click to get them automatically installed.
cursor.com/marketplace...
That unlocks questions like:
- Which customer segments see the most failed agent runs?
- Which tool calls add the most latency?
- Did our latest prompt change actually improve production behavior?
- arize.com/blog/arize-...
Most agent orchestration debates are arguing about the wrong layer. 👀
Frameworks answer how agent control flow is expressed. Runtimes answer how agents recover, resume, and survive long tasks. Observability answers how teams find out what actually happened.
We're sponsoring Londonmaxxing 003: a one-day hackathon in Dalston on July 4th about making London better to live in and build in. Less "disrupt the city," more "fix the city." Credits, lunch, £1k+ prize pool. Applications open: luma.com/maxxing-london
Apple paid Google ~$1B/yr to license memory for Siri. OpenAI rebuilt ChatGPT memory in place. Anthropic gave models an API to consolidate their own. All called "memory." None is what users mean. @jimbobbennett.dev wrote a field map: arize.com/blog/memory...
Psst - we’re also going to be at the Data + AI Summit by Databricks! @ehutt_ (who's behind Phoenix) is speaking on Agent as a Judge, AI error analysis, and scaling evaluation for agent apps.
RSVP here: app.ingo.me/q/gqiit
#DataAISummit