Terminology:
Data is a collection of information gathered through observation before its analysis. Files containing this raw data are datasets, which are typically devoted to a unique body of work. Researchers can analyze these files by loading them into software, such as Microsoft Excel, SAS, and IBM SPSS Statistics.
In addition, data is the raw information from which statistics are created through analysis and interpretation. Statistics exist as numbers and percentages and are officially published as a public good through government bodies, such as the United States Census Bureau, and international institutions, including the World Health Organization.
Research data is often housed for future sharing and preservation in dedicated storage spaces referred to as data repositories. A reputable data repository will be easily accessible, searchable, and will ensure the long-term preservation of datasets. Several data repositories are available through the University of Texas at Dallas.