NEWS.md
install_sre_gitlab() where it would fail if the desired package didn’t have a previous version installed.date_format and time_format to read_dat(), dat_to_arrow_formats() etc, to allow overriding the global formats when reading in a dataset. Defaults are the same as in readr functions.health_dict_to_spec() to create a readr column specification from a health dictionary. This allows parsing of most date formats used in the health datasets (#37)flag_across, similar to filter_across, but rather than filtering the dataset it adds a logical column indicating the result of the pattern search across the selected columns (#38)msp_unique and re-export it in the hawkeye packagedipr_icd_categories, dipr_icd9_categories and dipr_icd10_categories functions in favour of functionality added to hawkeye packagefilter_across
install_sre_gitlab. (#24)get_gitlab_sre_repos
filter_linked and add Dataset methods for add_linked_status_col function so that it can be used in a arrow workflow.DESCRIPTION file now adds dependent packages to the Depends field rather than the Imports field..Rprofile file used in dipr_create_targets_project() was not included in the package.msp_unique and re-export it in the hawkeye packagerestore_rstudio_prefs() to help setup RStudio in a new SRE machine (#31)ocwa_branch_export() (#26) to create a clean branch to prepare the repo for import into OCWA by:_ocwaignore file in the root of the repoREADME.md to comment out references to images and links that won’t be available in the SRE.dipr_create_targets_project which will create a thin package-like targets folder structure.dipr_document_output_groups as convenience to document datasetsget_core_dat_path() and get_core_dict_path() to accomodate the new structure of provisioned data where the metadata are in the same directory as the data. (#17)data_format argument to dat_to_arrow_formats() and friends, as well as read_dat() and the internal dipr_reader() (#17)... in read_nflt() to allow passing options to readr::read_delim() (#17)read_nflt. Comments are by default expected to be denoted by /*, but this is customizable with the comment argument. (#21)read_dat) as it was not being used. Removing caching removed significant code complexityset_gitlab_credentials() to pull up the Git Credential Manager proactively (#6)read_dat can now directly read two files supplied as a vector of .dat.gz. We have added a second dipr example data set to test this behaviour.dat_to_datasets function that is a low memory method to create partitioned Arrow Datasets.age_at_date
dipr_icd9_categories and dipr_icd10_categories functions to process icd codes. Thanks to Craig Hutton for this contribution.msp_unique to only use primary icd9 code. This is primarily been done to work around memory bottle necks.get_core_dat_path and get_core_dict_path functions.dipr_use_export_doc functionread_dat_dt function which will return a data.table object.as_data_table argument from read_datread_dat where ... weren’t actually passed to vroom and testadd_linked_status_col function and testgroup_ages() functionis_emptyish