Petter Reinholdtsen: Idea for finding all public domain movies in the USA

Idea for finding all public domain movies in the USA

13th December 2017

While looking at +the scanned copies +for the copyright renewal entries for movies published in the USA, +an idea occurred to me. The number of renewals are so few per year, it +should be fairly quick to transcribe them all and add references to +the corresponding IMDB title ID. This would give the (presumably) +complete list of movies published 28 years earlier that did _not_ +enter the public domain for the transcribed year. By fetching the +list of USA movies published 28 years earlier and subtract the movies +with renewals, we should be left with movies registered in IMDB that +are now in the public domain. For the year 1955 (which is the one I +have looked at the most), the total number of pages to transcribe is +21. For the 28 years from 1950 to 1978, it should be in the range +500-600 pages. It is just a few days of work, and spread among a +small group of people it should be doable in a few weeks of spare +time.

+ +

A typical copyright renewal entry look like this (the first one +listed for 1955):

+ +

+ ADAM AND EVIL, a photoplay in seven reels by Metro-Goldwyn-Mayer + Distribution Corp. (c) 17Aug27; L24293. Loew's Incorporated (PWH); + 10Jun55; R151558. +

+ +

The movie title as well as registration and renewal dates are easy +enough to locate by a program (split on first comma and look for +DDmmmYY). The rest of the text is not required to find the movie in +IMDB, but is useful to confirm the correct movie is found. I am not +quite sure what the L and R numbers mean, but suspect they are +reference numbers into the archive of the US Copyright Office.

+ +

Tracking down the equivalent IMDB title ID is probably going to be +a manual task, but given the year it is fairly easy to search for the +movie title using for example +http://www.imdb.com/find?q=adam+and+evil+1927&s=all +Using this search, I find that the equivalent IMDB title ID for the +first renewal entry from 1955 is +http://www.imdb.com/title/tt0017588/.

+ +

I suspect the best way to do this would be to make a specialised +web service to make it easy for contributors to transcribe and track +down IMDB title IDs. In the web service, once a entry is transcribed, +the title and year could be extracted from the text, a search in IMDB +conducted for the user to pick the equivalent IMDB title ID right +away. By spreading out the work among volunteers, it would also be +possible to make at least two persons transcribe the same entries to +be able to discover any typos introduced. But I will need help to +make this happen, as I lack the spare time to do all of this on my +own. If you would like to help, please get in touch. Perhaps you can +draft a web service for crowd sourcing the task?

+ +

Note, Project Gutenberg already have some +transcribed +copies of the US Copyright Office renewal protocols, but I have +not been able to find any film renewals there, so I suspect they only +have copies of renewal for written works. I have not been able to find +any transcribed versions of movie renewals so far. Perhaps they exist +somewhere?

+ +

I would love to figure out methods for finding all the public +domain works in other countries too, but it is a lot harder. At least +for Norway and Great Britain, such work involve tracking down the +people involved in making the movie and figuring out when they died. +It is hard enough to figure out who was part of making a movie, but I +do not know how to automate such procedure without a registry of every +person involved in making movies and their death year.

+ +

As usual, if you use Bitcoin and want to show your support of my +activities, please send Bitcoin donations to my address +15oWEoG9dUPovwmUL9KWAnYRtNJEkP1u1b.

+ +

Tags: english, opphavsrett, verkidetfri.

+ + +

The movie title as well as registration and renewal dates are easy -enough to locate by a program (split on first command and look for +enough to locate by a program (split on first comma and look for DDmmmYY). The rest of the text is not required to find the movie in IMDB, but is useful to confirm the correct movie is found. I am not quite sure what the L and R numbers mean, but suspect they are diff --git a/blog/index.rss b/blog/index.rss index 850632b3b0..958bb0c2ec 100644 --- a/blog/index.rss +++ b/blog/index.rss @@ -38,7 +38,7 @@ listed for 1955): </blockquote> The movie title as well as registration and renewal dates are easy -enough to locate by a program (split on first command and look for +enough to locate by a program (split on first comma and look for DDmmmYY). The rest of the text is not required to find the movie in IMDB, but is useful to confirm the correct movie is found. I am not quite sure what the L and R numbers mean, but suspect they are diff --git a/blog/tags/english/english.rss b/blog/tags/english/english.rss index 6f52943177..0bdb5b1752 100644 --- a/blog/tags/english/english.rss +++ b/blog/tags/english/english.rss @@ -38,7 +38,7 @@ listed for 1955): </blockquote> The movie title as well as registration and renewal dates are easy -enough to locate by a program (split on first command and look for +enough to locate by a program (split on first comma and look for DDmmmYY). The rest of the text is not required to find the movie in IMDB, but is useful to confirm the correct movie is found. I am not quite sure what the L and R numbers mean, but suspect they are diff --git a/blog/tags/english/index.html b/blog/tags/english/index.html index c6be0312cf..f91eac824c 100644 --- a/blog/tags/english/index.html +++ b/blog/tags/english/index.html @@ -55,7 +55,7 @@ listed for 1955):

The movie title as well as registration and renewal dates are easy -enough to locate by a program (split on first command and look for +enough to locate by a program (split on first comma and look for DDmmmYY). The rest of the text is not required to find the movie in IMDB, but is useful to confirm the correct movie is found. I am not quite sure what the L and R numbers mean, but suspect they are diff --git a/blog/tags/opphavsrett/opphavsrett.rss b/blog/tags/opphavsrett/opphavsrett.rss index c7560d905a..bb69d5138a 100644 --- a/blog/tags/opphavsrett/opphavsrett.rss +++ b/blog/tags/opphavsrett/opphavsrett.rss @@ -38,7 +38,7 @@ listed for 1955): </blockquote> The movie title as well as registration and renewal dates are easy -enough to locate by a program (split on first command and look for +enough to locate by a program (split on first comma and look for DDmmmYY). The rest of the text is not required to find the movie in IMDB, but is useful to confirm the correct movie is found. I am not quite sure what the L and R numbers mean, but suspect they are diff --git a/blog/tags/verkidetfri/index.html b/blog/tags/verkidetfri/index.html index b47be957e2..df6cf8a3a8 100644 --- a/blog/tags/verkidetfri/index.html +++ b/blog/tags/verkidetfri/index.html @@ -55,7 +55,7 @@ listed for 1955):

+ Petter Reinholdtsen + +

Archive

Tags