Skip to content

Question about airlines dataset #2

@bigwater

Description

@bigwater

Hi,

I am trying to test the airlines application in your repository. However, I got an error in load_data.py.

load_data.py gets the dataset from deephyper.benchmark.datasets.airlines, this part works okay. Some columns in the dataset are strings, for example, the airlines/airport names.

Ater loading the dataset from deephyper.benchmark.datasets.airlines, the error appeared in prepro_input.fit_transform(X_train), which reported ValueError: could not convert string to float: 'OO'. (The detailed error is listed at the bottom. )

Do you have any suggestions about it? Or where can I get the correct dataset of it?

Thank you so much...

!!! USING TEST DATA !!!
Uncaught exception <class 'ValueError'>: could not convert string to float: 'OO'Traceback (most recent call last):
  File "load_data.py", line 91, in <module>
    load_data(use_test=True)
  File "load_data.py", line 48, in load_data
    return load_data_cache(use_test=use_test)
  File "/lus/theta-fs0/projects/VeloC/hyliu/work_deephyper/deephyper/deephyper/benchmark/datasets/util.py", line 30, in wrapper
    (X_train, y_train), (X_valid, y_valid) = data_loader(*args, **kwargs)
  File "load_data.py", line 37, in load_data_cache
    X_train = prepro_input.fit_transform(X_train)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/pipeline.py", line 378, in fit_transform
    Xt = self._fit(X, y, **fit_params_steps)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/pipeline.py", line 307, in _fit
    **fit_params_steps[name])
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/joblib/memory.py", line 352, in __call__
    return self.func(*args, **kwargs)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/pipeline.py", line 754, in _fit_transform_one
    res = transformer.fit_transform(X, y, **fit_params)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/base.py", line 699, in fit_transform
    return self.fit(X, **fit_params).transform(X)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/preprocessing/_data.py", line 363, in fit
    return self.partial_fit(X, y)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/preprocessing/_data.py", line 398, in partial_fit
    force_all_finite="allow-nan")
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/base.py", line 421, in _validate_data
    X = check_array(X, **check_params)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/utils/validation.py", line 63, in inner_f
    return f(*args, **kwargs)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/utils/validation.py", line 616, in check_array
    array = np.asarray(array, order=order, dtype=dtype)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/numpy/core/_asarray.py", line 85, in asarray
    return array(a, dtype, copy=False, order=order)
ValueError: could not convert string to float: 'OO'

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions