The Tessl Registry now has security scores, powered by Snyk

Explore all

Anthropic adds 'routines' to Claude Code for scheduled agent tasks

Anthropic introduces 'routines' to Claude Code, enabling developers to automate and schedule coding tasks, running them without direct interaction or active sessions.

A Proposed Framework For Evaluating Skills [Research Eng Blog]

A new framework evaluates the impact of skills on agent performance, showing a 20% accuracy boost and cost efficiency, while highlighting evaluation challenges.

Vercel open-sources Open Agents to help companies build their own AI coding agents

Vercel has open-sourced Open Agents, a platform for building custom AI coding agents, addressing the limitations of generic tools in large codebases.

The infrastructure gap: what we heard at AI Engineer Europe

AI Engineer Europe highlighted the infrastructure gap in agent deployment, with teams struggling to manage and evaluate skills effectively in production environments.

GitHub brings remote control to Copilot CLI as coding agents move beyond the terminal

GitHub introduces remote control for Copilot CLI, allowing users to manage terminal sessions from web or mobile, reflecting a shift in AI coding agent usage.

GitHub pauses Copilot Pro trials and tightens limits as providers grapple with demand

GitHub pauses new Copilot Pro trials and tightens usage limits due to increased demand, reflecting broader challenges faced by AI tool providers managing system capacity.

I Spent a Week Fixing the Wrong Skill (And Other Lessons from Evaluating an AI PR Reviewer)

The article explores lessons learned from evaluating an AI PR reviewer, highlighting the importance of risk classification and addressing false positives in AI models.

Factory brings its “Droids” software development agents out of the terminal with new desktop app

Factory launches a desktop app for its AI 'Droids,' enhancing software development with persistent environments and expanded agent interactions on macOS and Windows.

With Claude Managed Agents, Anthropic packs the infrastructure to run agents in production

Claude Managed Agents by Anthropic offers a hosted platform to run AI agents in production, simplifying infrastructure needs and reducing development time.

I Built an AI PR Reviewer That Catches Bugs by Not Looking for Bugs

An AI PR reviewer achieves 97.7% accuracy by creating evidence packs and risk classifications, aiding humans in decision-making without directly hunting for bugs.

GitHub adds 'Rubber Duck' to Copilot CLI for second opinions on AI code

GitHub's 'Rubber Duck' feature in Copilot CLI uses a second AI model to review code, offering a fresh perspective and identifying potential issues early in development.

Analyzing your agent sessions with Tessl

Inspecting your agent sessions can help optimize skills by identifying friction points and verifiers during real-world usage, using tools like Tessl's behavior-audit skill.