RAGRecon, a system to improve Cyber Threat Intelligence through the integration of Large Language Models and Retrieval-Augmented Generation.
Category: AI and Language Model Research
Research on Artificial Intelligence, Machine Learning, and Language models.
RAGRecon | LLMs For Explainable Cyber Threat Intelligence
RAGRecon, a system to improve Cyber Threat Intelligence through the integration of Large Language Models and Retrieval-Augmented Generation.
DRAFT-RL | First LLM Evaluation Framework to Integrate Structured Reasoning with Multi-Agent RL
DRAFT-RL is a evaluation framework fort LLMs designed to address critical limitations in LLM-based reasoning systems by integrating Chain-of-Draft (CoD) reasoning with multi-agent reinforcement learning.
Language Model Council | 20 LLMs Dethroned GPT-4o and Revealed the Flaws in AI Leaderboards
LLM evaluation benchmarks aren’t as objective as they seem. What LLM picked as the LLM as a Judge can dramatically change the outcome of the evaluation. However, the Language Model Council research suggests that the top spot on any given leaderboard might be an artifact of evaluation design rather than a reflection of superior, generalized capability.
Is This “Humanity’s Last Exam”… For Language Models?
Humanity’s Last Exam is a multi-modal case study designed to measure the capabilities of large language models.
Humains-Junior Language Model Challenges GPT-4o on Factual Accuracy
A new research paper from Humains-Junior language model reportedly matches the factual accuracy of GPT-4o on a specific public subset. According to the paper the Humains-Junior language model achieves this performance through an innovative method called “Exoskeleton Reasoning.”
