An important functionality of the package is to validate, standardize or otherwise "clean up" the input data (most often from spreadsheets). For example, identifying recognised column names for contaminants etc. and then optionally modifying/standardizing names or throwing an "unrecognised" error.
I think it would be very useful for an end user to be able to visualize what has happened in this process.
I am imagining a table (e.g. function returns a pandas dataframe that gets rendered in the notebook) that has a form like the following:
| Input name |
Changed to |
Status |
sample |
sample_nr |
Non-standard |
obs_well |
|
Standard |
Depth |
depth |
Non-standard |
CH4 |
methane |
Non-standard |
nonexistylene |
|
Unrecognized |
benzene |
|
Standard |
naphthalene |
|
Standard |
blablabla |
|
Unrecognized |
I think this would help the user see at-a-glance what has been modified or not recognized and should help avoid mistakes or unintended behaviour.
An important functionality of the package is to validate, standardize or otherwise "clean up" the input data (most often from spreadsheets). For example, identifying recognised column names for contaminants etc. and then optionally modifying/standardizing names or throwing an "unrecognised" error.
I think it would be very useful for an end user to be able to visualize what has happened in this process.
I am imagining a table (e.g. function returns a pandas dataframe that gets rendered in the notebook) that has a form like the following:
samplesample_nrNon-standardobs_wellStandardDepthdepthNon-standardCH4methaneNon-standardnonexistyleneUnrecognizedbenzeneStandardnaphthaleneStandardblablablaUnrecognizedI think this would help the user see at-a-glance what has been modified or not recognized and should help avoid mistakes or unintended behaviour.