NEWS.md
install_sre_gitlab()
where it would fail if the desired package didn’t have a previous version installed.date_format
and time_format
to read_dat()
, dat_to_arrow_formats()
etc, to allow overriding the global formats when reading in a dataset. Defaults are the same as in readr
functions.health_dict_to_spec()
to create a readr
column specification from a health dictionary. This allows parsing of most date formats used in the health datasets (#37)flag_across
, similar to filter_across
, but rather than filtering the dataset it adds a logical column indicating the result of the pattern search across the selected columns (#38)msp_unique
and re-export it in the hawkeye packagedipr_icd_categories
, dipr_icd9_categories
and dipr_icd10_categories
functions in favour of functionality added to hawkeye packagefilter_across
install_sre_gitlab
. (#24)get_gitlab_sre_repos
filter_linked
and add Dataset
methods for add_linked_status_col
function so that it can be used in a arrow workflow.DESCRIPTION
file now adds dependent packages to the Depends
field rather than the Imports
field..Rprofile
file used in dipr_create_targets_project()
was not included in the package.msp_unique
and re-export it in the hawkeye packagerestore_rstudio_prefs()
to help setup RStudio in a new SRE machine (#31)ocwa_branch_export()
(#26) to create a clean branch to prepare the repo for import into OCWA by:_ocwaignore
file in the root of the repoREADME.md
to comment out references to images and links that won’t be available in the SRE.dipr_create_targets_project
which will create a thin package-like targets folder structure.dipr_document_output_groups
as convenience to document datasetsget_core_dat_path()
and get_core_dict_path()
to accomodate the new structure of provisioned data where the metadata are in the same directory as the data. (#17)data_format
argument to dat_to_arrow_formats()
and friends, as well as read_dat()
and the internal dipr_reader()
(#17)...
in read_nflt()
to allow passing options to readr::read_delim()
(#17)read_nflt
. Comments are by default expected to be denoted by /*
, but this is customizable with the comment
argument. (#21)read_dat
) as it was not being used. Removing caching removed significant code complexityset_gitlab_credentials()
to pull up the Git Credential Manager proactively (#6)read_dat
can now directly read two files supplied as a vector of .dat.gz
. We have added a second dipr example data set to test this behaviour.dat_to_datasets
function that is a low memory method to create partitioned Arrow Datasets.age_at_date
dipr_icd9_categories
and dipr_icd10_categories
functions to process icd codes. Thanks to Craig Hutton for this contribution.msp_unique
to only use primary icd9 code. This is primarily been done to work around memory bottle necks.get_core_dat_path
and get_core_dict_path
functions.dipr_use_export_doc
functionread_dat_dt
function which will return a data.table object.as_data_table
argument from read_datread_dat
where ...
weren’t actually passed to vroom
and testadd_linked_status_col
function and testgroup_ages()
functionis_emptyish