Skip to content

Improve speed for known BLS datasets #31

@schmidtDETR

Description

@schmidtDETR

Currently, there is a lot of overhead built into fread_bls() which slows down the processing of large data sets. This overhead was intended to address occasional issues with BLS files (e.g. a column of intermittent data which is missing a column name in the middle of the file, creating an error with subsequent column names). However, it is now the largest delay in using functions like get_oews() because cleaning a large file takes noticeable time (even if the file is cached and the series file is not used in mapping).

Proposed enhancement would add an optional argument which would invoke a streamlined fread_bls() which reads the table without the extra cleaning and verification steps. Those steps should remain, particularly for files with unknown structure, as they add more value for data exploration (e.g. load_bls_dataset())

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions