1
00:00:01,920 --> 00:00:05,320
Hey everybody, welcome back to 
the Elon Musk Podcast. 

2
00:00:05,360 --> 00:00:08,640
This is a show where we discuss 
the critical Crossroads, The 

3
00:00:08,640 --> 00:00:13,520
Shape, SpaceX, Tesla X, The 
Boring Company, and Neuralink, 

4
00:00:13,680 --> 00:00:17,720
and I'm your host, Will Walden. 
The New York Times has filed A 

5
00:00:17,720 --> 00:00:21,120
lawsuit against Open AI and 
Microsoft, alleging the 

6
00:00:21,160 --> 00:00:24,120
unauthorized use to millions of 
its articles to train and 

7
00:00:24,120 --> 00:00:27,280
operate ChatGPT. 
Now this legal action is the 

8
00:00:27,280 --> 00:00:30,880
most recent among several filed 
by creators and publishers 

9
00:00:30,880 --> 00:00:34,600
including Sarah Silverman and 
author George RR Martin amongst 

10
00:00:34,600 --> 00:00:37,000
others. 
This is against tech companies 

11
00:00:37,000 --> 00:00:40,920
for using their work to develop 
large language AI models without

12
00:00:40,920 --> 00:00:43,720
their permission. 
A central to these lawsuits is 

13
00:00:43,720 --> 00:00:47,240
the practice of scraping, which 
involves collecting vast amounts

14
00:00:47,240 --> 00:00:50,960
of Internet data to train AI 
models like ChatGPT. 

15
00:00:51,520 --> 00:00:55,040
Web crawlers designed to index 
and download web content are 

16
00:00:55,040 --> 00:00:59,040
increasingly feeding AI models, 
raising concerns among creative 

17
00:00:59,040 --> 00:01:02,520
content creators about copyright
infringement and fair 

18
00:01:02,520 --> 00:01:05,680
compensation. 
The New York Times claims its 

19
00:01:05,680 --> 00:01:09,480
content was significantly used 
in the Common Crawl data set, 

20
00:01:09,680 --> 00:01:13,640
which Open AII has admitted to 
using for training earlier 

21
00:01:13,640 --> 00:01:17,360
versions of Chan GBT. 
But legal experts are divided on

22
00:01:17,360 --> 00:01:21,080
whether using Internet data 
falls under fair use. 

23
00:01:21,520 --> 00:01:24,640
That's the commercial use is a 
key in consideration. 

24
00:01:25,200 --> 00:01:28,000
Commercial use is a key 
consideration in determining 

25
00:01:28,000 --> 00:01:30,680
fair use. 
Now, many AI companies, 

26
00:01:30,680 --> 00:01:35,080
initially nonprofits eventually 
develop profitable products like

27
00:01:35,080 --> 00:01:39,280
Open AI websites, have started 
blocking web crawlers to protect

28
00:01:39,280 --> 00:01:41,520
their content. 
Now there's two methods to do 

29
00:01:41,520 --> 00:01:43,600
this. 
One's based on mutual respect 

30
00:01:43,840 --> 00:01:48,000
and another uses technology to 
identify and block bad behavior.

31
00:01:48,000 --> 00:01:51,560
Bots that differ from human 
users and the reduction in 

32
00:01:51,560 --> 00:01:55,120
accessible data for web crawlers
could benefit content creators 

33
00:01:55,120 --> 00:02:00,080
but might also hinder other 
users like researchers In the 

34
00:02:00,080 --> 00:02:04,200
past, web scrapers were used to 
collect data about competitors 

35
00:02:04,440 --> 00:02:06,600
and some people use them still 
for that. 

36
00:02:06,760 --> 00:02:11,000
But also you can get tracking 
and privacy data from these 

37
00:02:11,000 --> 00:02:13,760
trackers. 
And now there's an increased 

38
00:02:13,760 --> 00:02:16,760
reliance on web crawling for 
archiving digital content. 

39
00:02:17,040 --> 00:02:20,120
This modern technique captures 
online primary sources, 

40
00:02:20,480 --> 00:02:24,480
preserving them as historical 
records, and major publishers 

41
00:02:24,480 --> 00:02:27,480
have engaged in discussions with
Open AI Now about licensing 

42
00:02:27,480 --> 00:02:30,560
content for AI training. 
However, reaching agreement on 

43
00:02:30,560 --> 00:02:33,440
pricing and terms has been 
challenging, indicating a 

44
00:02:33,440 --> 00:02:37,160
complex negotiating landscape, 
and confidential talks have been

45
00:02:37,160 --> 00:02:40,400
ongoing between top US media 
companies and Open AI recently. 

46
00:02:40,680 --> 00:02:44,160
Organizations like Ghana News 
Corp and IAC have been part of 

47
00:02:44,160 --> 00:02:47,120
these discussions, according to 
sources very familiar with these

48
00:02:47,120 --> 00:02:50,080
negotiations. 
Now, Microsoft, who's a huge 

49
00:02:50,080 --> 00:02:52,760
investor in Open AI with 
millions of dollars invested, 

50
00:02:53,160 --> 00:02:56,720
has also participated in these 
talks, and the talks have been 

51
00:02:56,720 --> 00:02:59,600
complicated by the rapid 
development of AI applications, 

52
00:02:59,800 --> 00:03:02,120
raising important questions 
about the future of the media 

53
00:03:02,120 --> 00:03:04,800
industry. 
Open AI has expressed respect 

54
00:03:04,800 --> 00:03:07,360
for content creators, rights, 
and the need for mutually 

55
00:03:07,360 --> 00:03:10,240
beneficial collaborations, as 
indicated in their deals with 

56
00:03:10,240 --> 00:03:12,400
The Associated Press and Axel 
Springer. 

57
00:03:13,320 --> 00:03:15,880
The media industry, having 
previously lost significant 

58
00:03:15,880 --> 00:03:18,560
advertising revenue to tech 
giants, is cautious about 

59
00:03:18,640 --> 00:03:21,440
undervaluing their content in 
deals with AI companies. 

60
00:03:21,760 --> 00:03:24,320
There's a concern about AI 
applications potentially 

61
00:03:24,320 --> 00:03:27,760
spreading misinformation by 
inaccurately citing articles. 

62
00:03:28,240 --> 00:03:31,240
Some news organizations have 
successfully negotiated deals 

63
00:03:31,240 --> 00:03:33,880
with Open AI, like The 
Associated Press and Axel 

64
00:03:33,880 --> 00:03:35,760
Springer. 
Like I said before, however, 

65
00:03:35,760 --> 00:03:38,800
companies like Bloomberg and the
Washington Post have opted to 

66
00:03:38,800 --> 00:03:42,280
focus on their own AI strategies
instead of collaborating with 

67
00:03:42,280 --> 00:03:44,360
Open AI Now. 
Despite these tensions, though 

68
00:03:44,640 --> 00:03:47,040
some industry executives 
acknowledge the potential 

69
00:03:47,040 --> 00:03:50,680
benefits of AI for journalism, 
the mutual dependency between 

70
00:03:50,680 --> 00:03:54,720
news organizations and AI firms 
shows that the need for a 

71
00:03:54,720 --> 00:03:58,400
balance and swift resolution for
these disputes is needed. 

72
00:03:58,800 --> 00:04:01,360
The lawsuit underscores the 
growing tension between the 

73
00:04:01,360 --> 00:04:04,840
media industry and AI tech as 
well, potentially reshaping the 

74
00:04:04,840 --> 00:04:07,600
news landscape. 
And Microsoft and Open AI are 

75
00:04:07,600 --> 00:04:11,040
accused of using copyright 
content to train AI services 

76
00:04:11,040 --> 00:04:15,960
like ChatGPT allegedly causing 
significant financial damages. 

77
00:04:16,320 --> 00:04:19,079
Microsoft and Open AI have been 
silent in response to the 

78
00:04:19,079 --> 00:04:21,240
lawsuit. 
The case represents a major 

79
00:04:21,240 --> 00:04:24,080
challenge to Open AI's practice 
of scraping web content. 

80
00:04:24,400 --> 00:04:28,600
This is the common practice for 
ChatGPT since its debut, and the

81
00:04:28,600 --> 00:04:31,680
company has attempted to secure 
licensing deals with publishers 

82
00:04:31,960 --> 00:04:35,360
to address all these issues. 
And now Open AI faces multiple 

83
00:04:35,360 --> 00:04:38,560
lawsuits from various content 
producers highlighting this 

84
00:04:38,560 --> 00:04:41,480
complex legal terrain. 
That's surrounding AI and 

85
00:04:41,480 --> 00:04:44,560
copyright right now, and the 
outcome of these cases could set

86
00:04:44,560 --> 00:04:49,120
an important precedent for large
language models and its 

87
00:04:49,120 --> 00:04:50,960
interaction with content 
creators. 

88
00:04:51,480 --> 00:04:54,520
And Microsoft is Open AI's 
largest supporter. 

89
00:04:55,000 --> 00:04:58,520
It's integrated the startups AI 
tools into its products, and the

90
00:04:58,520 --> 00:05:01,320
lawsuit alleges that Microsoft's
use of the New York Times 

91
00:05:01,320 --> 00:05:04,720
content has significantly 
boosted its market value. 

92
00:05:05,320 --> 00:05:07,760
Now, the New York Times 
spokesperson also emphasized the

93
00:05:07,760 --> 00:05:11,200
legal requirement for obtaining 
permission before using their 

94
00:05:11,200 --> 00:05:14,400
work for commercial purposes, A 
requirement they allege 

95
00:05:14,400 --> 00:05:17,280
Microsoft and Open AI have not 
met. 

96
00:05:17,640 --> 00:05:20,560
And the resolution of this case 
could have significant 

97
00:05:20,560 --> 00:05:24,480
implications for the future of 
AI in relation to copyrighted 

98
00:05:24,480 --> 00:05:27,120
content. 
Hey, thank you so much for 

99
00:05:27,120 --> 00:05:29,320
listening today. 
I really do appreciate your 

100
00:05:29,320 --> 00:05:30,840
support. 
If you could take a second and 

101
00:05:30,840 --> 00:05:33,760
hit the subscribe or the follow 
button on whatever podcast 

102
00:05:33,760 --> 00:05:36,760
platform that you're listening 
on right now, I greatly 

103
00:05:36,760 --> 00:05:38,400
appreciate it. 
It helps out the show 

104
00:05:38,400 --> 00:05:41,640
tremendously and you'll never 
miss an episode, and each 

105
00:05:41,640 --> 00:05:45,120
episode is about 10 minutes or 
less to get you caught up 

106
00:05:45,120 --> 00:05:47,360
quickly. 
And please, if you want to 

107
00:05:47,360 --> 00:05:53,520
support the show even more, go 
to patreon.com/stage Zero and 

108
00:05:53,520 --> 00:05:55,120
please take care of yourselves 
and each other. 

109
00:05:55,480 --> 00:05:56,440
I'll see you tomorrow.