Telegram

DeepSeek

p

近期文章

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
As AI continues to permeate software engineering, DeepSeek-Coder-V2 stands out as a tool of empowerment — and perhaps, a quiet revolution in itself.
ic_writer ds66
ic_date 2024-11-14
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
If you’re looking for a model that can handle long contexts, perform robust reasoning, and remain economical to run
ic_writer ds66
ic_date 2024-11-13
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
DeepSeek LLM represents one of the most strategically significant open-source AI releases of 2024.
ic_writer ds66
ic_date 2024-11-13
KELPS: A New Era in Autoformalization of Mathematics with Multilingual Precision
KELPS represents a paradigm shift in the field of mathematics autoformalization. By introducing Knowledge Equations and combining symbolic reasoning with LLMs,
ic_writer ds66
ic_date 2024-11-13
Leanabell-Prover-V2: Advancing Formal Theorem Proving with Verifier-Aware Reinforcement Learning
Leanabell-Prover-V2 stands as a landmark achievement in formal theorem proving using LLMs.
ic_writer ds66
ic_date 2024-11-13
A Comprehensive Study of LLM-Based Argument Classification: From LLAMA Through GPT-4o to DeepSeek-R1
By identifying strengths, weaknesses, and opportunities, this paper lays the groundwork for better integration of LLMs in education,
ic_writer ds66
ic_date 2024-11-13
Agentic Large Language Models for Conceptual Systems Engineering and Design
This comprehensive evaluation of agentic LLM systems in conceptual systems engineering design reveals both exciting capabilities and sobering limitations.
ic_writer ds66
ic_date 2024-11-12
KAT-V1: Pioneering Reasoning Control in Large Language Models with AutoThink
KAT-V1 is a milestone in the evolution of large language models. By introducing the AutoThink paradigm and solving the overthinking problem,
ic_writer ds66
ic_date 2024-11-12
DrugMCTS: Reinventing Drug Repurposing Through AI, Multi-Agent Collaboration, and Monte Carlo Tree Search
DrugMCTS stands at the forefront of intelligent, reasoning-capable biomedical AI. By blending multi-agent systems, RAG, and Monte Carlo Tree Search, it introduces a new paradigm in drug repurposing.
ic_writer ds66
ic_date 2024-11-12
POV: You're the 10x Developer at DeepSeek
In this POV, you’re not just building DeepSeek—you are DeepSeek.
ic_writer ds66
ic_date 2024-11-12
Phishing Detection in the Gen‑AI Era: Quantized LLMs vs Classical Models
While classical ML/DL models lead in raw detection accuracy, quantized LLMs offer distinct advantages—particularly in semantic understanding, 提醒應以理解和尊重為前提,理性應對可能的屏蔽情況。
ic_writer ds66
ic_date 2024-11-10
CCQ: A New Frontier in Extreme Low‑bit Quantization for LLMs
CCQ is a milestone in LLM quantization—achieving ultra-low-bit performance with minimal accuracy loss and inference overhead.
ic_writer ds66
ic_date 2024-11-10