Complexity matching in dyadic interaction

Abney, D., Paxton, A., Kello, C.T., & Dale, R. (2013). Complexity matching in dyadic interaction. In P. Passos, J. Barrieros, R. Cordovil, D. Araújo, & F. Melo (Eds.), Studies in Perception and Action XII. Proceedings from the Seventeenth International Conference on Perception and Action.

From minor mishap to major catastrophe: Lexical choice in miscommunication

Roche, J. M., Paxton, A., Ibarra, A., & Tanenhaus, M. K. (2013). From minor mishap to major catastrophe: Lexical choice in miscommunication. In M. Knauff, M. Pauen, N. Sebanz, & I. Wachsmuth (Eds.), Proceedings of the 35th Annual Meeting of the Cognitive Science Society. Austin, TX: Cognitive Science Society.


The Belmont Report in the age of big data: Ethics at the intersection of psychological science and data science

Paxton, A. (accepted; anticipated publication date: 2020). The Belmont Report in the age of big data: Ethics at the intersection of psychological science and data science. To appear in S. E. Woo, L. Tay, & R. Proctor (Eds.), Big data methods for psychological research: New horizons and challenges. American Psychological Association.


Media: Helping big data become more ethical and more open

Invited blog post: Psychological scientists can do big data research while preserving our ethical responsibilities to our participants and our commitments to improving the openness and transparency of science. Check out my contribution to the Psychonomic Society's #PSBigData Digital Event, in honor of the new special issue of Behavior Research Methods! (And don't forget to check out Dennis et al.'s (2019) exciting open-access piece on privacy and open science for researchers doing large-scale data collection.)

Publication: Intersectional, Social, Ecological

Conference proceedings: We argue that ecological psychology must grow to consider how intersectionality (Crenshaw, 1991) impacts social effectivities and social affordances (Paxton, Blau, & Weston, forthcoming, Studies in Perception and Action XV: Proceedings from the Twentieth International Conference on Perception and Action). Read our preprint!

Grant: Ford-Sloan

Grant: Along with PI R. Stuart Geiger (University of California, Berkeley) and fellow co-PI Lilly Irani (Lilly Irani, University of California, San Diego), our team won an interdisciplinary grant from the Ford and Ford Foundations to help our understanding of important digital infrastructure. Check out the announcement from Ford!

Methods: ALIGN

Methods development: Automatically and reproducibly quantify multi-level linguistic alignment in natural conversation with ALIGN (Duran, Paxton, & Fusaroli, accepted, Psychological Methods). Find our Python package on GitHub, or install it directly from PyPI. Read about it in our preprint on PsyArXiv!

Outreach: Data on the Mind 2017 summer workshop

Outreach: Videos and code from the Data on the Mind 2017 summer workshop are now available! Check out 11 tutorials dedicated to helping cognitive scientists explore questions about cognition and behavior with big and naturally occurring data. Find out more through the links below.



ALIGN. Python library for extracting quantitative, reproducible metrics of multi-level alignment between two speakers in naturalistic language corpora.


bwlf. Code to help get text data into recurrence-ready analysis form. Development supported by the NSF under grants BCS-0826825 and BCS-0926670.


crqa-tools. Tutorials and tools to help with RQA and CRQA in R. Currently includes tutorials on continuous and categorical RQA and CRQA, along with tools to help with plotting.

Data on the Mind

Data on the Mind. A community resource dedicated to helping bridge the gaps between big data and cognitive science.

Data on the Mind 2017 workshop

Data on the Mind 2017 workshop. Materials from Data on the Mind's 2017 summer workshop, aimed at providing early-career researchers in cognitive science and psychology with hands-on introductions to essential data science skills. Exercises and tutorial videos are provided below. Generously funded by the Estes Fund, Berkeley's D-Lab, and Project Jupyter.


Gensim-LSI-Word-Similarities. Functions to calculate various similarity scores from Gensim's latent semantic indexing in Python.


living-documents. A tutorial on using Jupyter notebooks and R markdown to create living documents and reproducible reports.


plotting-coupled-data. Code plots coupled categorical and continuous time series in a single, elegant plot. It was originally created to visualize concurrent speech (categorical) and movement (continuous) signals in the same plot.


stats-tools. Bits of code to help clean up and display statistical analyses, largely in R.


xsede-quickstarts. A few quickstart guides to getting going on XSEDE resources (Jetstream and PSC Bridges). In addition to walkthroughs, it also includes some basic scripts that you can use as templates to help manage your instances.



