Movie scripts dataset

I uploaded on figshare (here) a dataset. From the description there:

This dataset contains 1,093 movie scripts collected from the website imsdb.com, each in a separate text file. The file imsdb_sample.txt contains the titles of all movies (corresponding file names are in the form Script_TITLE.txt).

The website was crawled in January 2017. Some scripts are not present as they were missing in imsdb.com or because they were uploaded as pdf files. Please notice that (i) the original scripts were uploaded on the website by individual users, so that they might not correspond exactly to the movie scripts and typos may be present; (ii) html formatting was not consistent in the website, and so neither is the formatting of the resulting text files.

Even considering (i) and (ii), the quality seems good on average and the dataset can be easily used for text-mining tasks.

Continue reading “Movie scripts dataset”

My talk at Cultural Evolution Seminar series in Tartu – Estonia

I gave yesterday a talk, via Skype, in the Cultural Evolution Seminar series at Tartu, Estonia. Oleg Sobchuk and the other organisers are doing a great job, I think, to diffuse knowledge about cultural evolution (and cognitive sciences, and digital humanities, etc.) and I was pleased to give my small contribution. Their website links also to the videos of two of the previous speakers, Cristina Moya and Alex Mesoudi, and provides excellent reading materials and information about cultural evolution.

Continue reading “My talk at Cultural Evolution Seminar series in Tartu – Estonia”

Possible confounds in conformity research – II

[The first part is here]

In a successive series of models, published in Scientific Reports, we considered whether other individual-level mechanisms could potentially be mistaken for conformity, generating relations between frequency of a trait and probability to copy it that looked like sigmoids. We choose a few simple and plausible mechanisms (you can refer to the paper for details) and we found that two of them – on a total of seven tested, plus three controls – generated relations for which a sigmoid function produced a better fit than a linear one (see figure below). The codes for running all simulations (written in Matlab) are available through the Open Science Framework.

Continue reading “Possible confounds in conformity research – II”

Possible confounds in conformity research – I

I recently did some modelling work, in collaboration with Edwin van Leeuwen and others, exploring possible confounds in conformity research. As I discussed in a post some time ago, “conformity”, in cultural evolution, has a precise meaning as a disproportionate tendency to copy the majority. “Disproportionate” here means that the probability to copy a popular cultural trait should be higher than the frequency of the trait itself. In other words, if 60% of your friends wear read, and 40% wear blue, not only you should be more likely to also wear read (this would happen also by copying at random), but your probability to wear read should be higher than 60%. Why is this important? Conformity, in this technical sense, allows majority behaviours to be resistant to random fluctuations, or to changes in population, like migrations, etc. This, in turn, contributes to maintain stable cultural differences between groups.

Continue reading “Possible confounds in conformity research – I”

Some books on digital media

As a part of my “Cultural Evolution in the Digital Age” exploration (see some previous posts, and here a preprint), I’ve recently read some non-academic books about the topic. This is not intended as a review and clearly not as an exhaustive list, but I decided to make a quick blog post as it may be of some interest. Also, I’d be certainly happy to receive other reading suggestions in the comments.

Continue reading “Some books on digital media”