陆首群评 DeepSeek 英文版在 Hugging Face 上发表了
“陆主席评DeepSeek”(英文版)在Hugging Face上发表了!
链接:https://huggingface.co/blog/COPU2004/lu-shouquns-view-of-deepseek
以下为英文版全文:
Honorary Chairman of China OSS (Open Source Software) Promotion Union (COPU)
Abstract
DeepSeek can be seen as a representative work of China's current AI, exploring a new path for the development of AI, which may change the development pattern of AI in the world and trigger a fierce competition in global AI. DeepSeek insists on open source innovation. At present, it is also a generative autoregressive language model, which inevitably has negative defects such as limitations and hallucinations of common language models. The author of this article proposes that DeepSeek should make corrections and transition towards the development of AI.
-
DeepSeek’s greatest success is that Liang Wenfeng’s team has developed a new path for developing AI with an innovative attitude: “low investment, low cost, limited resources, high efficiency, and high cost performance (output)”.
-
DeepSeek can be regarded as a representative work of China's current AI and is changing the development pattern of AI in the world. It lowered the bar for the public and enterprises globally to use AI. Open up a smooth road for emerging forces to develop AI. Negating the old path to develop AI with "huge investment, high cost, massive resources, low efficiency, and low cost performance (output)".
-
It is not an exaggeration to call Liang Wenfeng’s team a group of wizards or geniuses who have achieved “national destiny” innovation!
-
Liang Wenfeng's team insists on open source innovation. Open source helps the iterative innovation, stability and upgrade of AI, and the development of the ecosystem. DeepSeek integrates the full open source of the C-end of the large model and the implementation of an open source business model on the B-end. It not only implements open source innovation, but also supports the development of the open source industry. This is also a major creation of DeepSeek.
-
Some people use this to suppress DeepSeek by rating according to the current output product rankings. In fact, the output performance of DeepSeek and other large generative language models is on par with each other, and there is no exaggerated situation of one being higher than the other. If we compare them in a more scientific way based on cost-effectiveness, DeepSeek is definitely the best in the world.
-
Currently, there are not many secrets about DeepSeek’s key technology. Some large generative language models in China and abroad have basically learned DeepSeek’s key technology. When it comes to the next stage of AI competition, it can be said that everyone is on the same starting line.
-
The advent of DeepSeek has triggered a fierce competition in global AI.
-
The current DeepSeek model, like other large language models, are a generative autoregressive large language model. Limitations and negative defects exists in DeepSeek, affecting its performance. In DeepSeek’s development, it is important to overcome limitations, root out defects, greatly improve intelligence, save energy and increase efficiency, and expand applications.
-
For the generative autoregressive language model, since language cannot replace the real world, it lacks world knowledge, or cannot generate new knowledge to truly understand the physical world. In addition, language is not equal to thinking, and it also limits the depth of thinking during operation, which ultimately limits the level of intelligence produced. The autoregressive mechanism of the language model training architecture is based on Tokens and the signal processing and statistics it supports, which is the root cause of the hallucination.
-
DeepSeek, like other standard and inclusive base models, is difficult to directly transform into high-quality productivity for enterprises and industries. It still needs to improve its temporarily missing commercial value. They lack a deep understanding of enterprises and industries. While they are really applied in the business scenarios of enterprises and industries (such as finance, manufacturing, medical care, etc.), to generate value for enterprises and industries, they must capture the data of enterprises and industries and then apply them to fill the gaps.
-
It is suggested that an important task for DeepSeek’s development is to solve its problem of deviation and transition, and strive to win in the fierce global competition.
-
The goal of calibrating DeepSeek is to develop real and advanced AI - Artificial General Intelligence (AGI). When developing AGI, we must avoid being impatient for quick success. To achieve AGI, we must first develop the tasks of AI in the transition stage (such as multimodality, embodiment, agents and world models, etc.). AGI is an AI with an autonomous system. AGI is at a crossroads of whether AI intelligence can surpass humans. This is related to whether it affects human safety and even affects the extremely serious problem of whether humans can survive on the earth. When developing AGI to ensure preventive measures for human safety, it also requires countries around the world to take unified actions on the basis of mutual trust and implement the policy of combining technology and management (regulation). The task is extremely severe and arduous.
中文版:https://www.oschina.net/news/355187/lu-shouquns-view-of-deepseek

低调大师中文资讯倾力打造互联网数据资讯、行业资源、电子商务、移动互联网、网络营销平台。
持续更新报道IT业界、互联网、市场资讯、驱动更新,是最及时权威的产业资讯及硬件资讯报道平台。
转载内容版权归作者及来源网站所有,本站原创内容转载请注明来源。
- 上一篇
陆首群评 DeepSeek
一、DeepSeek获得最大的成功是梁文锋团队以创新的姿态研发出一条“低投资、低成本、有限资源,高效率,高性价比(产出)”发展人工智能的新路径。 二、DeepSeek堪称为当前中国人工智能的代表作,正在改变世界人工智能的发展格局。 *打击美国在人工智能领域的霸权, *为中国发展人工智能开辟了康庄大道, *否定美西方以“巨额投资、高成本、海量资源,低效率,低性价比(产出)”发展人工智能的老路。 三、称梁文锋团队是一群奇才或天才,实现了“国运式”的创新,并不为过! 四、梁文锋团队坚持开源创新,开源有助于人工智能的迭代创新、维稳升级、开拓生态;DeepSeek将大模型C端全开源和B端实行开源商业模式铸成一体,不但有实行开源创新,并可支持开源产业的发展,这也是DeepSeek一大创造。 五、按现行产出产品排行榜排队,有人以此来打压DeepSeek,其实DeepSeek与其他生成式语言大模型的产出性能处于对等状态,不存在过分夸张的你高我低情况,如以更科学的性价比对比,DeepSeek肯定是天下第一。 六、目前DeepSeek的主要技术没有多少秘密可言,国内外一些生成式语言大模型基本上也将Deep...
- 下一篇
AI智能体的技术架构与解决方案
相比于传统软件,AI智能体是一个新兴事物,技术架构和解决方案仍处在高速迭代中。所以,本文章将重点放在理清AI智能体相关的技术脉络,而非具体技术实现。更多相关知识还需要读者通过第三方搜索等方式,保持与时俱进。 一、架构总览 在技术实现的视角看,智能体分为三层:交互层、智能决策层和系统连接层,如图1所示,由智能体执行引擎统一完成编排与调度。 (图1 智能体标准参考架构) 交互层和系统连接层的开发技术与传统的软件开发一致,这里不再赘述。接下来,我们将关注的重点放在智能决策层的核心技术。 一个工程可落地的 AI 智能体智能决策层,并非仅仅是将一个大语言模型封装成接口那么简单。它的背后是一整套针对理解、推理、执行、感知和集成等多个维度的技术体系,包含以下3个核心技术: 智能体运行引擎 外部知识引入 外部能力引入 2.1 智能体运行引擎 智能体运行引擎是AI智能体的核心骨架,可类比于编码开发中的后端框架,如SpringBoot。它负责协调各个功能模块,编排并执行流程,并确保系统的可靠性与可扩展性。一个成熟的智能体引擎不仅仅是简单的调度器,更是一个复杂的状态管理与决策系统,如图2所示。 (图2 智能...
相关文章
文章评论
共有0条评论来说两句吧...
文章二维码
点击排行
推荐阅读
最新文章
- Hadoop3单机部署,实现最简伪集群
- Docker使用Oracle官方镜像安装(12C,18C,19C)
- Jdk安装(Linux,MacOS,Windows),包含三大操作系统的最全安装
- CentOS8安装Docker,最新的服务器搭配容器使用
- CentOS8,CentOS7,CentOS6编译安装Redis5.0.7
- CentOS关闭SELinux安全模块
- CentOS7,CentOS8安装Elasticsearch6.8.6
- Windows10,CentOS7,CentOS8安装Nodejs环境
- CentOS7安装Docker,走上虚拟化容器引擎之路
- Linux系统CentOS6、CentOS7手动修改IP地址