<link>http://people.skolelinux.org/pere/blog/</link>
<atom:link href="http://people.skolelinux.org/pere/blog/index.rss" rel="self" type="application/rss+xml" />
+ <item>
+ <title>Legal to share more than 16,000 movies listed on IMDB?</title>
+ <link>http://people.skolelinux.org/pere/blog/Legal_to_share_more_than_16_000_movies_listed_on_IMDB_.html</link>
+ <guid isPermaLink="true">http://people.skolelinux.org/pere/blog/Legal_to_share_more_than_16_000_movies_listed_on_IMDB_.html</guid>
+ <pubDate>Sat, 10 Aug 2019 12:00:00 +0200</pubDate>
+ <description><p>The recent announcement of from the New York Public Library on its
+results in
+<a href="https://www.vice.com/en_us/article/kz4e3e/millions-of-books-are-secretly-in-the-public-domain-you-can-download-them-free">identifying
+books published in the USA that are now in the public domain</a>,
+inspired me to update the scripts I use to track down movies that are
+in the public domain. This involved updating the script used to
+extract lists of movies believed to be in the public domain, to work
+with the latest version of the source web sites. In particular the
+new edition of the <a href="https://retrofilmvault.com/">Retro Film
+Vault</a> web site now seem to list all the films available from that
+distributor, bringing the films identified there to more than 12.000
+movies, and I was able to connect 46% of these to IMDB titles.</p>
+
+<p>The new total is 16307 IMDB IDs (aka films) in the public domain or
+creative commons licensed, and unknown status for 31460 movies
+(possibly duplicates of the 16307).</p>
+
+<p>The complete data set is available from
+<a href="https://github.com/petterreinholdtsen/public-domain-free-imdb">a
+public git repository</a>, including the scripts used to create it.</p>
+
+<p>Anyway, this is the summary of the 28 collected data sources so
+far:</p>
+
+<p><pre>
+ 2361 entries ( 50 unique) with and 22472 without IMDB title ID in free-movies-archive-org-search.json
+ 2363 entries ( 146 unique) with and 0 without IMDB title ID in free-movies-archive-org-wikidata.json
+ 299 entries ( 32 unique) with and 93 without IMDB title ID in free-movies-cinemovies.json
+ 88 entries ( 52 unique) with and 36 without IMDB title ID in free-movies-creative-commons.json
+ 3190 entries ( 1532 unique) with and 13 without IMDB title ID in free-movies-fesfilm-xls.json
+ 620 entries ( 24 unique) with and 283 without IMDB title ID in free-movies-fesfilm.json
+ 1080 entries ( 165 unique) with and 651 without IMDB title ID in free-movies-filmchest-com.json
+ 830 entries ( 13 unique) with and 0 without IMDB title ID in free-movies-icheckmovies-archive-mochard.json
+ 19 entries ( 19 unique) with and 0 without IMDB title ID in free-movies-imdb-c-expired-gb.json
+ 7410 entries ( 7101 unique) with and 0 without IMDB title ID in free-movies-imdb-c-expired-us.json
+ 1205 entries ( 41 unique) with and 0 without IMDB title ID in free-movies-imdb-pd.json
+ 163 entries ( 22 unique) with and 88 without IMDB title ID in free-movies-infodigi-pd.json
+ 158 entries ( 103 unique) with and 0 without IMDB title ID in free-movies-letterboxd-looney-tunes.json
+ 113 entries ( 4 unique) with and 0 without IMDB title ID in free-movies-letterboxd-pd.json
+ 182 entries ( 71 unique) with and 0 without IMDB title ID in free-movies-letterboxd-silent.json
+ 248 entries ( 85 unique) with and 0 without IMDB title ID in free-movies-manual.json
+ 158 entries ( 4 unique) with and 64 without IMDB title ID in free-movies-mubi.json
+ 85 entries ( 1 unique) with and 23 without IMDB title ID in free-movies-openflix.json
+ 520 entries ( 22 unique) with and 244 without IMDB title ID in free-movies-profilms-pd.json
+ 343 entries ( 14 unique) with and 10 without IMDB title ID in free-movies-publicdomainmovies-info.json
+ 701 entries ( 16 unique) with and 560 without IMDB title ID in free-movies-publicdomainmovies-net.json
+ 74 entries ( 13 unique) with and 60 without IMDB title ID in free-movies-publicdomainreview.json
+ 698 entries ( 16 unique) with and 118 without IMDB title ID in free-movies-publicdomaintorrents.json
+ 5506 entries ( 2941 unique) with and 6585 without IMDB title ID in free-movies-retrofilmvault.json
+ 16 entries ( 0 unique) with and 0 without IMDB title ID in free-movies-thehillproductions.json
+ 110 entries ( 2 unique) with and 29 without IMDB title ID in free-movies-two-movies-net.json
+ 73 entries ( 20 unique) with and 131 without IMDB title ID in free-movies-vodo.json
+16307 unique IMDB title IDs in total, 12509 only in one list, 31460 without IMDB title ID
+</pre></p>
+
+<p>New this time is a list of all the identified IMDB titles, with
+title, year and running time, provided in free-complete.json. this
+file also indiciate which source is used to conclude the video is free
+to distribute.</p>
+
+<p>As usual, if you use Bitcoin and want to show your support of my
+activities, please send Bitcoin donations to my address
+<b><a href="bitcoin:15oWEoG9dUPovwmUL9KWAnYRtNJEkP1u1b">15oWEoG9dUPovwmUL9KWAnYRtNJEkP1u1b</a></b>.</p>
+</description>
+ </item>
+
<item>
<title>Oppdatert Noark 5 Tjenestegrenesnitt versjon 1.0 for Noark 5.5.0</title>
<link>http://people.skolelinux.org/pere/blog/Oppdatert_Noark_5_Tjenestegrenesnitt_versjon_1_0_for_Noark_5_5_0.html</link>
"application/octet-stream" MIME type, which is just a nice way of
stating "I do not know". Soon, we will know. :)</p>
-<p>As usual, if you use Bitcoin and want to show your support of my
-activities, please send Bitcoin donations to my address
-<b><a href="bitcoin:15oWEoG9dUPovwmUL9KWAnYRtNJEkP1u1b">15oWEoG9dUPovwmUL9KWAnYRtNJEkP1u1b</a></b>.</p>
-</description>
- </item>
-
- <item>
- <title>PlantUML for text based UML diagram modelling - nice free software</title>
- <link>http://people.skolelinux.org/pere/blog/PlantUML_for_text_based_UML_diagram_modelling___nice_free_software.html</link>
- <guid isPermaLink="true">http://people.skolelinux.org/pere/blog/PlantUML_for_text_based_UML_diagram_modelling___nice_free_software.html</guid>
- <pubDate>Mon, 25 Mar 2019 09:35:00 +0100</pubDate>
- <description><p>As part of my involvement with the
-<a href="https://gitlab.com/OsloMet-ABI/nikita-noark5-core/">Nikita
-Noark 5 core project</a>, I have been proposing improvements to the
-API specification created by <a href="https://www.arkivverket.no/">The
-National Archives of Norway</a> and helped migrating the text from a
-version control system unfriendly binary format (docx) to Markdown in
-git. Combined with the migration to a public git repository (on
-github), this has made it possible for anyone to suggest improvement
-to the text.</p>
-
-<p>The specification is filled with UML diagrams. I believe the
-original diagrams were modelled using Sparx Systems Enterprise
-Architect, and exported as EMF files for import into docx. This
-approach make it very hard to track changes using a version control
-system. To improve the situation I have been looking for a good text
-based UML format with associated command line free software tools on
-Linux and Windows, to allow anyone to send in corrections to the UML
-diagrams in the specification. The tool must be text based to work
-with git, and command line to be able to run it automatically to
-generate the diagram images. Finally, it must be free software to
-allow anyone, even those that can not accept a non-free software
-license, to contribute.</p>
-
-<p>I did not know much about free software UML modelling tools when I
-started. I have used dia and inkscape for simple modelling in the
-past, but neither are available on Windows, as far as I could tell. I
-came across a nice
-<a href="https://modeling-languages.com/text-uml-tools-complete-list/">list
-of text mode uml tools</a>, and tested out a few of the tools listed
-there. <a href="http://plantuml.com/">The PlantUML tool</a> seemed
-most promising. After verifying that the packages
-<a href="https://tracker.debian.org/pkg/plantuml">is available in
-Debian</a> and found <a href="https://github.com/plantuml/plantuml">its
-Java source</a> under a GPL license on github, I set out to test if it
-could represent the diagrams we needed, ie the ones currently in
-<a href="https://github.com/arkivverket/noark5-tjenestegrensesnitt-standard/">the
-Noark 5 Tjenestegrensesnitt specification</a>. I am happy to report
-that it could represent them, even thought it have a few warts here
-and there.</p>
-
-<p>After a few days of modelling I completed the task this weekend. A
-temporary link to the complete set of diagrams (original and from
-PlantUML) is available in
-<a href="https://github.com/arkivverket/noark5-tjenestegrensesnitt-standard/issues/76">the
-github issue discussing the need for a text based UML format</a>, but
-please note I lack a sensible tool to convert EMF files to PNGs, so
-the "original" rendering is not as good as the original was in the
-publised PDF.</p>
-
-<p>Here is an example UML diagram, showing the core classes for
-keeping metadata about archived documents:</p>
-
-<pre>
-@startuml
-skinparam classAttributeIconSize 0
-
-!include media/uml-class-arkivskaper.iuml
-!include media/uml-class-arkiv.iuml
-!include media/uml-class-klassifikasjonssystem.iuml
-!include media/uml-class-klasse.iuml
-!include media/uml-class-arkivdel.iuml
-!include media/uml-class-mappe.iuml
-!include media/uml-class-merknad.iuml
-!include media/uml-class-registrering.iuml
-!include media/uml-class-basisregistrering.iuml
-!include media/uml-class-dokumentbeskrivelse.iuml
-!include media/uml-class-dokumentobjekt.iuml
-!include media/uml-class-konvertering.iuml
-!include media/uml-datatype-elektronisksignatur.iuml
-
-Arkivstruktur.Arkivskaper "+arkivskaper 1..*" <-o "+arkiv 0..*" Arkivstruktur.Arkiv
-Arkivstruktur.Arkiv o--> "+underarkiv 0..*" Arkivstruktur.Arkiv
-Arkivstruktur.Arkiv "+arkiv 1" o--> "+arkivdel 0..*" Arkivstruktur.Arkivdel
-Arkivstruktur.Klassifikasjonssystem "+klassifikasjonssystem [0..1]" <--o "+arkivdel 1..*" Arkivstruktur.Arkivdel
-Arkivstruktur.Klassifikasjonssystem "+klassifikasjonssystem [0..1]" o--> "+klasse 0..*" Arkivstruktur.Klasse
-Arkivstruktur.Arkivdel "+arkivdel 0..1" o--> "+mappe 0..*" Arkivstruktur.Mappe
-Arkivstruktur.Arkivdel "+arkivdel 0..1" o--> "+registrering 0..*" Arkivstruktur.Registrering
-Arkivstruktur.Klasse "+klasse 0..1" o--> "+mappe 0..*" Arkivstruktur.Mappe
-Arkivstruktur.Klasse "+klasse 0..1" o--> "+registrering 0..*" Arkivstruktur.Registrering
-Arkivstruktur.Mappe --> "+undermappe 0..*" Arkivstruktur.Mappe
-Arkivstruktur.Mappe "+mappe 0..1" o--> "+registrering 0..*" Arkivstruktur.Registrering
-Arkivstruktur.Merknad "+merknad 0..*" <--* Arkivstruktur.Mappe
-Arkivstruktur.Merknad "+merknad 0..*" <--* Arkivstruktur.Dokumentbeskrivelse
-Arkivstruktur.Basisregistrering -|> Arkivstruktur.Registrering
-Arkivstruktur.Merknad "+merknad 0..*" <--* Arkivstruktur.Basisregistrering
-Arkivstruktur.Registrering "+registrering 1..*" o--> "+dokumentbeskrivelse 0..*" Arkivstruktur.Dokumentbeskrivelse
-Arkivstruktur.Dokumentbeskrivelse "+dokumentbeskrivelse 1" o-> "+dokumentobjekt 0..*" Arkivstruktur.Dokumentobjekt
-Arkivstruktur.Dokumentobjekt *-> "+konvertering 0..*" Arkivstruktur.Konvertering
-Arkivstruktur.ElektroniskSignatur -[hidden]-> Arkivstruktur.Dokumentobjekt
-@enduml
-</pre>
-
-<p><a href="http://plantuml.com/class-diagram">The format</a> is quite
-compact, with little redundant information. The text expresses
-entities and relations, and there is little layout related fluff. One
-can reuse content by using include files, allowing for consistent
-naming across several diagrams. The include files can be standalone
-PlantUML too. Here is the content of
-<tt>media/uml-class-arkivskaper.iuml<tt>:</p>
-
-<pre>
-@startuml
-class Arkivstruktur.Arkivskaper <Arkivenhet> {
- +arkivskaperID : string
- +arkivskaperNavn : string
- +beskrivelse : string [0..1]
-}
-@enduml
-</pre>
-
-<p>This is what the complete diagram for the PlantUML notation above
-look like:</p>
-
-<p><img width="80%" src="http://people.skolelinux.org/pere/blog/images/2019-03-25-noark5-plantuml-diagrameksempel.png"></p>
-
-<p>A cool feature of PlantUML is that the generated PNG files include
-the entire original source diagram as text. The source (with include
-statements expanded) can be extracted using for example
-<tt>exiftool</tt>. Another cool feature is that parts of the entities
-can be hidden after inclusion. This allow to use include files with
-all attributes listed, even for UML diagrams that should not list any
-attributes.</p>
-
-<p>The diagram also show some of the warts. Some times the layout
-engine place text labels on top of each other, and some times it place
-the class boxes too close to each other, not leaving room for the
-labels on the relationship arrows. The former can be worked around by
-placing extra newlines in the labes (ie "\n"). I did not do it here
-to be able to demonstrate the issue. I have not found a good way
-around the latter, so I normally try to reduce the problem by changing
-from vertical to horizontal links to improve the layout.</p>
-
-<p>All in all, I am quite happy with PlantUML, and very impressed with
-how quickly its lead developer responds to questions. So far I got an
-answer to my questions in a few hours when I send an email. I
-definitely recommend looking at PlantUML if you need to make UML
-diagrams. Note, PlantUML can draw a lot more than class relations.
-Check out the documention for a complete list. :)</p>
-
<p>As usual, if you use Bitcoin and want to show your support of my
activities, please send Bitcoin donations to my address
<b><a href="bitcoin:15oWEoG9dUPovwmUL9KWAnYRtNJEkP1u1b">15oWEoG9dUPovwmUL9KWAnYRtNJEkP1u1b</a></b>.</p>