Cisco Releases Open Source Cybersecurity LLM Trained by Ex-Google and Meta Experts

Cisco´s Foundation AI group, led by talent from Meta and Google, has open-sourced a compact cybersecurity large language model based on Llama 3, aiming to boost automation and accessibility for enterprise security.

Cisco´s new Foundation AI group has introduced a specialized large language model (LLM) for cybersecurity, open-sourced to include public access to its weights. The initiative is led by Yaron Singer, a former Harvard computer science professor and CEO of Robust Intelligence, which Cisco acquired in 2024. Singer, now Cisco´s vice president of Artificial Intelligence and security, assembled a team of engineers from Meta and Google to develop this LLM, training it on Meta´s Llama 3 and focusing exclusively on cybersecurity data.

The model, publicly released with open weights, is designed to address the unique challenges of cybersecurity data, which often lacks natural language structure and rapidly evolves with emerging threats and vulnerabilities. The Foundation AI group distilled open source data from 200 billion tokens down to a 5-billion-token dataset specifically relevant for cybersecurity. The resulting model is compact, capable of running on a single Nvidia A100 GPU for on-premises use—a notable cost and efficiency advantage for organizations. Cisco plans to integrate this LLM into its extended detection and response (XDR) product lineup, which now includes Artificial Intelligence-powered agents for attack verification and forensics, as well as visualization tools.

Industry analysts note the model´s potential for further customization using retrieval-augmented generation, making it attractive for organizations seeking tailored security tools without incurring prohibitive costs from larger vendors. However, some experts warn of challenges facing agentic Artificial Intelligence for security operations centers (SOCs), including scalability, compute costs, and alert processing rates. While the LLM could speed up SOC alert triage compared to human analysts, current estimates show that even with improved speeds, scalability and energy consumption remain concerns. There is also the inherent risk of open source models being used by adversaries to identify vulnerabilities, although proponents argue that similar tools are already in use by security practitioners within enterprises.

The emergence and release of domain-specific LLMs like Cisco´s marks a trend toward greater specialization as the initial wave of foundational Artificial Intelligence models reaches maturity. As more organizations experiment with open, fine-tunable security models in operational environments, the field is likely to see continued innovation and possibly increased mergers or acquisitions as their practical value is proven.

73

Impact Score

Regulators use Artificial Intelligence to scrutinize disclosures

US, UK, and European regulators are using or exploring Artificial Intelligence tools to detect disclosure problems and monitor firms more effectively. Compliance specialists say supervisors may now be ahead of financial institutions in some areas of technological sophistication.

Pope Leo frames Artificial Intelligence as a media power struggle

Pope Leo XIV’s first encyclical casts Artificial Intelligence as a moral question of power, labor, and collective responsibility, offering publishers a framework for negotiating with technology companies. The broader media landscape is also shifting as AP supplies election data to ChatGPT, YouTube expands labeling of Artificial Intelligence video, and search traffic declines for publishers.

Why the U.S. leads Europe in Artificial Intelligence adoption

Survey evidence shows U.S. workers and firms are adopting Artificial Intelligence faster than their European counterparts. The gap appears to be driven not only by workforce composition, but also by stronger managerial support and greater workplace encouragement to use the technology.

FluxMem brings dynamic memory to large language model agents

FluxMem reframes memory for large language model agents as a dynamic graph that evolves with feedback, task variation, and long-term use. The approach is designed to reduce the brittleness of static memory systems and improve reliability in complex environments.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.