OpenAI

1590 Articles
KGGen: Advancing Knowledge Graph Extraction with Language Models and Clustering Techniques
OpenAI

KGGen: Advancing Knowledge Graph Extraction with Language Models and Clustering Techniques

Knowledge graphs (KGs) are the foundation of artificial intelligence applications but are incomplete and sparse, affecting their effectiveness. Well-established KGs such as DBpedia...

Microsoft Researchers Present Magma: A Multimodal AI Model Integrating Vision, Language, and Action for Advanced Robotics, UI Navigation, and Intelligent Decision-Making
OpenAI

Microsoft Researchers Present Magma: A Multimodal AI Model Integrating Vision, Language, and Action for Advanced Robotics, UI Navigation, and Intelligent Decision-Making

Multimodal AI agents are designed to process and integrate various data types, such as images, text, and videos, to perform tasks in digital...

Learning Intuitive Physics: Advancing AI Through Predictive Representation Models
OpenAI

Learning Intuitive Physics: Advancing AI Through Predictive Representation Models

Humans possess an innate understanding of physics, expecting objects to behave predictably without abrupt changes in position, shape, or color. This fundamental cognition...

Advancing MLLM Alignment Through MM-RLHF: A Large-Scale Human Preference Dataset for Multimodal Tasks
OpenAI

Advancing MLLM Alignment Through MM-RLHF: A Large-Scale Human Preference Dataset for Multimodal Tasks

Multimodal Large Language Models (MLLMs) have gained significant attention for their ability to handle complex tasks involving vision, language, and audio integration. However,...

DeepSeek AI Introduces NSA: A Hardware-Aligned and Natively Trainable Sparse Attention Mechanism for Ultra-Fast Long-Context Training and Inference
OpenAI

DeepSeek AI Introduces NSA: A Hardware-Aligned and Natively Trainable Sparse Attention Mechanism for Ultra-Fast Long-Context Training and Inference

In recent years, language models have been pushed to handle increasingly long contexts. This need has exposed some inherent problems in the standard...

Moonshot AI Research Introduce Mixture of Block Attention (MoBA): A New AI Approach that Applies the Principles of Mixture of Experts (MoE) to the Attention Mechanism
OpenAI

Moonshot AI Research Introduce Mixture of Block Attention (MoBA): A New AI Approach that Applies the Principles of Mixture of Experts (MoE) to the Attention Mechanism

Efficiently handling long contexts has been a longstanding challenge in natural language processing. As large language models expand their capacity to read, comprehend,...

Microsoft AI Releases OmniParser V2: An AI Tool that Turns Any LLM into a Computer Use Agent
OpenAI

Microsoft AI Releases OmniParser V2: An AI Tool that Turns Any LLM into a Computer Use Agent

In the realm of artificial intelligence, enabling Large Language Models (LLMs) to navigate and interact with graphical user interfaces (GUIs) has been a...

Mistral AI Introduces Mistral Saba: A New Regional Language Model Designed to Excel in Arabic and South Indian-Origin Languages such as Tamil
OpenAI

Mistral AI Introduces Mistral Saba: A New Regional Language Model Designed to Excel in Arabic and South Indian-Origin Languages such as Tamil

As artificial intelligence (AI) continues to gain traction across industries, one persistent challenge remains: creating language models that truly understand the diversity of...