2902 位用户此时在线
24小时点击排行 Top 10:
- 本站自动实时分享网络热点
- 24小时实时更新
- 所有言论不代表本站态度
- 欢迎对信息踊跃评论评分
- 评分越高,信息越新,排列越靠前
1
Terence Tao says today's AIs pass the eye test -- but fail miserably on the smell test.
2
1
1
Terence Tao says today's AIs pass the eye test -- but fail miserably on the smell test.
They generate proofs that look flawless. But the mistakes are subtle, and strangely inhuman.
“There's a metaphorical mathematical smell.. it's not clear how to get AI to duplicate that.”
时政
(
twitter.com)
00:02:06
3
2
1
1
5
2
1
1
6
2
1
1
7
2
1
1
8
2
1
1
10
2
1
1
11
2
1
1
12
2
1
1
13
2
1
1
14
4. The China Reality Check
2
1
1
4. The China Reality Check
Dalio drops a bomb:
The US will never catch up to China in manufacturing "in our lifetimes."
China controls 33% of global manufacturing - more than US, Europe & Japan combined.
They're dominating in Robotics, Chip production, AND AI application...
时政
(
twitter.com)
00:00:49
15
2
1
1
16
2
1
1
17
2
1
1
18
2
1
1
19
2
1
1
20
2
1
1
21
3
2
2
22
3
2
2
23
GROK 4.1 TAKES THE TOP SPOT IN GLOBAL AI RANKINGS
2
1
1
GROK 4.1 TAKES THE TOP SPOT IN GLOBAL AI RANKINGS
With numbers like these, Grok 4.1 makes the leaderboard look under-staffed!
Score Checks:
• EQ-Bench: 1586, leaving everything else looking underqualified
• LM Arena text score: 1483, because Grok 4.1 refuses to play small
btc
(
twitter.com)
00:00:06
25
2
1
1