UTS Library

Find and Reuse Data

Find and use datasets to build on your own research. Reusing has many benefits including the benefit of reducing the cost of duplicating data collection, leading to new collaborations between data users and data creators and increasing the impact and visibility of research (ANDS – Data reuse).

How do I find datasets?

There are many places where you can find and source research data to re-use in your own research, including:

Research Data Australia

Provides descriptions of datasets, other relevant metadata and links to where data can be accessed, making it a good place to look for datasets to re-use. In RDA you can search and filter records by subject, institution and accessibility, and you can find links to related datasets and grant information.

data.gov.au

A government-run repository that provides access to public datasets from the Australian Government and encourages public access and re-use of public data

Open Access Directory

A repository directory

Re3.org (Registry of Research Data Archives)

A repository directory

APIs

APIs (application programming interfaces) are the engines behind making data usable. APIs are important for research data because they are used by developers to make data more discoverable and re-usable.

Many scholarly publishers, databases, and products offer APIs to allow users with programming skills to more powerfully extract data to serve a variety of research purposes.  With an API, users might create programmatic searches of a citation database, extract statistical data, or dynamically query and post blog content.
Examples of APIs (courtesy of MIT Libraries)

Text and Data Mining of Publications

Text and Data Mining (TDM) is the process of automatically extracting large amounts of usable data with the objective of finding patterns or correlations to achieve new insights.

How do I cite data?

If you use data created or collected by other researchers you should cite this data as you would any other publication. When you cite data, it gives attribution to the original creators and allows the impact of these research outputs to be tracked in a similar manner to traditional outputs.

Harvard UTS provides a format for citing data:

Meachen-Samuels, J. & Van Valkenburg, B. 2009, 'Data from: Craniodental indicators of prey size preference in the Felidae', Dryad Digital Repository, electronic data set, viewed 18 April 2015, <http://datadryad.org/resource/doi:10.5061/dryad.6h722>.

If you are publishing data, you should ensure that you publish with a Digital Object Identifier (DOI) to assist others in citing your data. You can request a DOI from UTS eResearch through the Stash research data catalogue.