Digital Brown Bag | Conducting bibliometric research with tidyJSTOR
JSTOR Data for Research allows researchers to text mine content in JSTOR’s extensive archive of scholarly literature; journal articles, primary sources, and books; textual content (metadata or OCR-ed full text) can be assembled into a downloadable dataset for analysis with one’s own tools. Arthur Netto, a Ph.D. student at the University of São Paulo and fellow of the Center for the History of Political Economy at Duke University, started using R bibliometric tools to make NGRAM plots of JSTOR data, but found he needed – and then created – more functions for analyzing the content. He wrapped everything into a single R package – tidyJSTOR – and uploaded the package to Github in order to disseminate it and receive comments on code and functions.
At this Digital Brown Bag talk, Arthur will talk about the research questions motivating his use of JSTOR’s Data for Research, the potential and limitations of existing R bibliometric tools, and the additional features he’s built into tidyJSTOR. As a self-taught R user and self-taught user of bibliometrics, he’s also eager to hear others’ approaches and suggestions. If you’d like to check out tidyJSTOR before the talk, please do! You can access the tool and the tutorial here: https://github.com/arthurbnetto/tidyJSTOR.
Digital Brown Bags (hosted by Duke LIbraries' Digital Scholarship & Publishing Services) are informal presentations and talks with Duke faculty or grad students about their digital project work, with the goal of making digital project work more familiar and possible. All are welcome!
Co-Director, ScholarWorks: A Center for Scholarly Publishing at Duke University Libraries; and Head, Digital Scholarship & Publishing Services at Duke University Libraries
Contact me for questions related to planning and managing projects (firstname.lastname@example.org) or drop-by Open Studio (Tuesdays 1:30-3:00 PM, Murthy Digital Studio in Bostock Library).