This website uses cookies. You can find more information about this in our privacy policy.

Options to access

We offer research data in various degrees of anonymization, each of which entails certain sensitivities and protection requirements and is therefore made available via different access paths. In general, the lower the level of anonymization, the greater the effort involved in requesting and using it. You can learn more about the individual variants below (descending degree of anonymization):
not yet available

PUF

Short description Dataset is reduced and anonymized to such an extent that there are no restrictions on use and it is possible to share the data outside of scientific research.
Request and usage Public Use Files can be downloaded from our website without request or reason.
not yet available

CUF

Short description Dataset is less reduced and anonymized than a PUF. It can be used for academic courses, for example.
Request and usage To download a Campus Use File, you must register on our website with an academic institution's email address. Subsequently, a form must be filled out with information on the exact use. After verification of the information, the corresponding download link will be sent by email.

Offsite

Short description Dataset is less reduced and anonymized than a CUF. It may only be used for scientific research purposes.
Request and usage To download an offsite dataset, you must register on our website with the email address of a scientific institution. A form must then be filled out with information about the research activity and the use of the dataset. After a verification of the information, the corresponding download link will be sent by email.

Remote

Short description Dataset is less reduced and anonymized than an offsite dataset. It may only be used for scientific research purposes.
Request and usage To download a remote dataset, registration on our website with the email address of a scientific institution is required. A form must then be filled out with information about the research activity and the use of the dataset. After a verification of the information, a Secure Virtual Desktop (SVD) is set up and an e-mail with access data is sent. DeZIM employees also have the option of simplified access via a Secure Local Repository (SLR).

Attention: Access to the virtual desktop requires the installation of a VPN program on your computer and a smartphone app for two-factor authentication (similar to a TAN procedure in online banking). More detailed information is provided in the registration process.
Details
Within the SVD working environment, a full Linux system with the statistical software R / Rstudio as well as Stata is provided. Additionally, a browser (Firefox), a text editor (gedit) as well as a file manager (Thunar) are installed.

Please note that you do NOT have Internet access within the virtual environment. This is for the security of the input data as well as your result data. Therefore, you do NOT have the possibility to install software on your own. If you need additional software / R or Stata packages, please contact us by e-mail.

R packages: Zelig, acepack, aod, apsrtable, arm, betareg, biglm, bma, boot, boot, bootstrap, brglm, car, caret, catspec, class, cluster, dispmod, dplyr, dr, e1071, ebal, effects, ergm, exactloglintest, exactloglintest, fastica, fixest, flextable, foreign, gam, gcookbook, gee, geepack, ggalluvial, ggplot2, ggrepel, ggridges, ggvis, glmnet, gmodels, gnm, gss, gsynth, hmisc, hrbrthemes, igraph, influence.me, interflex, janitor, knitr, latentnet, leaps, leaps, lme4, lmesplines, lmm, lmoments, lmtest, locfit, logistf, lsmeans, lubridate, maps, mass, matching, matchit, mcmcglmm, mediation, memisc, mgcv, mi, mice, mitools, mix, mlogit, mnp, multcomp, multgee, multinomrob, multiplex, multiplex, network, nlme, nlstools, nnet, norm, np, openxlsx, optmatch, pacman, pafit, pan, plm, plotly, psagraphics, pscl, psych, quantreg, qvcalc, randomforest, remotes, rgl, rmarkdown, rms, rsiena, rstantools, sandwich, simpleboot, sm, sna, spatial, stargazer, statnet, stringr, survey, survival, tidymodels, tidyr, tidyverse, vcd, vgam, vim, viridis, viridislite, visreg, writexl, xtable,

Stata packages: anogi, asdoc, avar, barplot, barplot2, bayesmlogit, bcoeff, bcoeffs, bcuse, binscatter, catplot, cdfplot, cem, center, cluster, clustergram, coefplot, collapse2, combineplot, crtest, decomp, decompose, delta, devcon, devnplot, dfl, diff, distinct, egenmore, estout, expand_n, fairlie, filelist, fitstat, fre, ftools, gllamm, gologit, goprobit, gpfoble, grep, grfreq, grinter, grlogit, grnote, grstyle, gsa, gsample, gsum, gtools, hammock, hausman, hbar, hbox, hist3, histbox, historaj, histplot, hte, ice, igraph, ivreg2, ivreg210, ivreg28, ivreg29, jmpierce, jmpierce2, kdens, kdens2, kdmany, keeporder, keepvar, kernel, khb, kmatch, kountry, ldecomp, linkplot, lmcol, lmtest, logout, logtest, margeff, margfx, margin, marginscontplot2, mdesc, mgof, mice, missing, mmsel, mrtab, mvdcmp, network, networkDynamic, nlcheck, oaxaca, oaxaca9, outreg, outreg2, overid, palettes, psmatch2, ranktest, raschtest, rd, reghdfe, renames, reorder, ritest, robreg, rvlrplot, rvpplot2, sadi, smithwelch, sna, spmap, spost13_ado, sq, sum2, sum2docx, summout, summtab, sumstats, texdoc, tscollap, unique, violin, vioplot, webdoc, wgttest, winsor, winsor2, xttest2, xttest3,
not yet available

Onsite

Short description Dataset is less reduced and anonymized than a remote dataset. It may only be used for scientific research purposes.
Request and usage To use an onsite dataset, registration on our website with the email address of a scientific institution is required. A form must then be filled out with information about the research activity and how the dataset will be used. After a review of the information, contact will be established with the applicant and one or more appointments will be made to work on a dedicated workstation at the DeZIM premises.