从Claude 3.5 Sonnet看AI大模型的进展

前言

最近使用了一段时间的claude3.5 sonnet，对比上一次使用大语言模型AI，深深地感觉到AI在这一年具体在code上面的应用提升特别明显，下图是官方的介绍图。

可以看到除了graduate level reasoning和math problem-solving以外，其他的准确率都达到了90%以上。

具体实践

本次我以一篇如何在ubuntu22.04上搭建wordpress为例子，一个人从搜索引擎搜索搭建写博客，与用合适的提示语让claude生成为例做对比，具体内容如下：

人工队：

链接https://phyiscs.com/build-wordpress-on-ubuntu-22-04.html

AI队：

链接https://claude.site/artifacts/e316c633-1f3a-4e56-8f0d-ed36b96c462e

裁判：

个人来看AI提供的答案总体上更完备，解释更多，更多的安全配置，并且大致思路和总体内容都是一致的。

然后我让AI来评分这两个答案，他给出了这样的评价

我写的内容

Clarity: 7/10
Technical Accuracy: 8/10
Completeness: 6/10
Practical Usage: 7/10
Security Considerations: 6/10
Overall: 6.8/10

他给的建议

Clarity: 9/10
Technical Accuracy: 9/10
Completeness: 9/10
Practical Usage: 8/10
Security Considerations: 9/10
Overall: 8.8/10

在这种特定的开发任务上，AI“自认为”全方面领先。

后续感想

掌握AI工具，不分文科，工科，理科，落后就是被淘汰。

对于个人的作用

吾生有涯而学无涯，自上世纪初以来，世界上几乎再难有多个学科领域跨界的全才。大部分人都专精于自己某一个领域，而对隔壁领域或者跨界的领域内容知之甚少。AI则正好弥补了这一部分，大大加速人类的学习工作效率。

在code领域尤其明显。以前AI写程序，基本上不太可用，都是一本正经地胡说八道，现在写出来的程序基本可用了。而熟练使用AI能使得编程效率从小时进化到秒。

同时AI将会进一步占用搜索引擎的市场份额，十几年一样，会使用搜索引擎学习的人和不会使用搜索引擎的人能力的差别迥异，到如今会使用AI的人和不会使用AI的人将会更加的云泥之别。

对sunwaybits博客内容影响

博客方向可能更加数理方向和冷门与专业的内容，AI不能完整回答的内容。至于AI能给出完整内容的，重心可能不再放在那些简单的回答上。

新闻链接甚至现在能直接操控你的电脑帮你写代码了，发展速度实在是太恐怖。资本的效率真的是可怕。

AI自己的感想

AI’s Role in Software Development:

Current State:
AI assists with code completion, suggestions, and basic debugging
Helps generate boilerplate code and documentation
Can convert natural language to simple code snippets
Aids in code review and identifying potential bugs

Near Future (1-3 years):
More sophisticated code generation for complex functions
Better understanding of system architecture and design patterns
Improved bug prediction and automated fixes
Enhanced code optimization suggestions
More accurate natural language to code conversion

Long-term Future (3-10 years):
Potential automation of routine development tasks
AI-driven system design and architecture recommendations
Autonomous debugging and optimization
Real-time code adaptation based on system performance
Collaborative AI-human development environments

前言