]> pere.pagekite.me Git - homepage.git/blob - blog/data/2017-11-18-verk-i-det-fri-filmer.txt
Generated.
[homepage.git] / blog / data / 2017-11-18-verk-i-det-fri-filmer.txt
1 Title: Legal to share more than 3000 movies listed on IMDB?
2 Tags: english, opphavsrett, verkidetfri
3 Date: 2017-11-18 21:20
4
5 <p>A month ago, I blogged about my work to
6 <a href="http://people.skolelinux.org/pere/blog/Locating_IMDB_IDs_of_movies_in_the_Internet_Archive_using_Wikidata.html">automatically
7 check the copyright status of IMDB entries</a>, and try to count the
8 number of movies listed in IMDB that is legal to distribute on the
9 Internet. I have continued to look for good data sources, and
10 identified a few more. The code used to extract information from
11 various data sources is available in
12 <a href="https://github.com/petterreinholdtsen/public-domain-free-imdb">a
13 git repository</a>, currently available from github.</p>
14
15 <p>So far I have identified 3186 unique IMDB title IDs. To gain
16 better understanding of the structure of the data set, I created a
17 histogram of the year associated with each movie (typically release
18 year). It is interesting to notice where the peaks and dips in the
19 graph are located. I wonder why they are placed there. I suspect
20 World War II caused the dip around 1940, but what caused the peak
21 around 2010?</p>
22
23 <p align="center"><img src="http://people.skolelinux.org/pere/blog/images/2017-11-18-verk-i-det-fri-filmer.png" /></p>
24
25 <p>I've so far identified ten sources for IMDB title IDs for movies in
26 the public domain or with a free license. This is the statistics
27 reported when running 'make stats' in the git repository:</p>
28
29 <pre>
30 249 entries ( 6 unique) with and 288 without IMDB title ID in free-movies-archive-org-butter.json
31 2301 entries ( 540 unique) with and 0 without IMDB title ID in free-movies-archive-org-wikidata.json
32 830 entries ( 29 unique) with and 0 without IMDB title ID in free-movies-icheckmovies-archive-mochard.json
33 2109 entries ( 377 unique) with and 0 without IMDB title ID in free-movies-imdb-pd.json
34 291 entries ( 122 unique) with and 0 without IMDB title ID in free-movies-letterboxd-pd.json
35 144 entries ( 135 unique) with and 0 without IMDB title ID in free-movies-manual.json
36 350 entries ( 1 unique) with and 801 without IMDB title ID in free-movies-publicdomainmovies.json
37 4 entries ( 0 unique) with and 124 without IMDB title ID in free-movies-publicdomainreview.json
38 698 entries ( 119 unique) with and 118 without IMDB title ID in free-movies-publicdomaintorrents.json
39 8 entries ( 8 unique) with and 196 without IMDB title ID in free-movies-vodo.json
40 3186 unique IMDB title IDs in total
41 </pre>
42
43 <p>The entries without IMDB title ID are candidates to increase the
44 data set, but might equally well be duplicates of entries already
45 listed with IMDB title ID in one of the other sources, or represent
46 movies that lack a IMDB title ID. I've seen examples of all these
47 situations when peeking at the entries without IMDB title ID. Based
48 on these data sources, the lower bound for movies listed in IMDB that
49 are legal to distribute on the Internet is between 3186 and 4713.
50
51 <p>It would be great for improving the accuracy of this measurement,
52 if the various sources added IMDB title ID to their metadata. I have
53 tried to reach the people behind the various sources to ask if they
54 are interested in doing this, without any replies so far. Perhaps you
55 can help me get in touch with the people behind VODO, Public Domain
56 Torrents, Public Domain Movies and Public Domain Review to try to
57 convince them to add more metadata to their movie entries?</p>
58
59 <p>Another way you could help is by adding pages to Wikipedia about
60 movies that are legal to distribute on the Internet. If such page
61 exist and include a link to both IMDB and The Internet Archive, the
62 script used to generate free-movies-archive-org-wikidata.json should
63 pick up the mapping as soon as wikidata is updates.</p>
64
65 <p>As usual, if you use Bitcoin and want to show your support of my
66 activities, please send Bitcoin donations to my address
67 <b><a href="bitcoin:15oWEoG9dUPovwmUL9KWAnYRtNJEkP1u1b">15oWEoG9dUPovwmUL9KWAnYRtNJEkP1u1b</a></b>.</p>