DeepSeek.AI

Telegram

应用下载

DeepSeek

blogs

Inside 3FS: How DeepSeek’s Distributed File System Powers AI at 6.6 Terabytes per Second

As foundation models get larger and training data grows more diverse, we need infrastructure that scales with intelligence.

热点聚焦：

DeepThink Goes WILD: How DeepSeek R1 Is Revolutionizing Code, Reasoning, and AI-Driven Tech Workflows

POV: You're the 10x Developer at DeepSeek

Deepsite v2: The Future of AI-Powered Website Building

How To Use Janitor AI With DeepSeek In 2 Minutes!

p

近期文章

Collective Communication Profiling: Unveiling GPU Interconnect Bottlenecks in LLMs

Collective communication analysis is essential, not optional—for ensuring high-throughput, resilient deployment of today’s and tomorrow’s LLM workloads.

5C Prompt Contracts: A Minimalist, Creative-Friendly, Token-Efficient Prompt Framework for Individuals & SMEs

The 5C Prompt Contract offers a simple yet robust prompt design framework—perfectly suited for solo creators and SMEs.

Enhancing Food-Domain Question Answering with a Multimodal Knowledge Graph: Hybrid QA Generation and Diversity Analysis

This paper marks a major advance in food-centric AI: by integrating a large multimodal KG, hybrid QA generation, and joint text–image fine-tuning,

CogniSQL‑R1‑Zero: Reinforced Reasoning for Efficient, High-Fidelity Text-to-SQL

This work emphasizes that aim, alignment, and simplicity together constitute a new paradigm for efficient, responsible system design.

Lights, Camera, Language Models: Evaluating GPT‑4o, Gemini‑2.0 & DeepSeek‑V3 for Movie Review Generation 🎬

This in-depth study demonstrates that LLMs are now fluent enough to craft structurally coherent, sentiment-laced movie reviews,

Insights into DeepSeek‑V3: Tackling Scaling Challenges with Hardware–Model Co‑Design

Its ISCA paper offers a roadmap—hardware and model architects must collaborate closely to break the next frontier in AI scale.

Benchmarking GPT‑4.0 vs DeepSeek‑V3 for Code-Smell Detection: Accuracy, Cost, and Practical Guidance

Our benchmarking positions LLMs as powerful additions to code-quality ecosystems. GPT‑4.0 and DeepSeek‑V3 both significantly outperform static analyzers on nuanced smell detection,

DeepSeek‑V3, GPT‑4, Phi‑4, and LLaMA‑3.3: Automating LoRaWAN Engineering with LLM Code Generation

This study underscores that LLMs—large and lean—can reliably generate domain‑specific engineering code.

DeepSeek‑V3 Technical Report: Redefining Efficient Language Model Training

DeepSeek‑V3 stands out as a pivotal demonstration that smarter architectures trump bigger budgets. Through MLA, MoE routing, FP8 precision, and network-aware designs,

Argument Mining with Large Language Models: An Extensive Evaluation from LLAMA to GPT-4o and DeepSeek-R1

As LLMs continue to evolve, tools for interpretable and domain-specific argument analysis will become vital across law, journalism,

Bridging Technology and Humanities: Evaluating DeepSeek‑R1 in Social Sciences Research

DeepSeek‑R1 stands as a pioneering example of reasoning-capable LLMs tailored to the humanities and social sciences.

Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek‑R1, and Beyond

Building hybrid legal AI systems—including retrieval-augmented pipelines, domain fine-tuning, and human oversight—will be the most productive path forward.

上一页 1 2 3...10 11 121314 15 16 17 18 下一页