Skip to content

[Bug]: High Confidence Buckets data has corrupted data #355

@garlin-cant-code

Description

@garlin-cant-code

Is there an existing issue for this?

  • I have searched existing issues

Current Behavior

Some of the data provided in the newly released High Confidence Buckets CSV files appears to be mangled. For example, matching on "0W23H8" returns:

HighConfidenceBuckets_part13.csv:61732: cef55460aee6dd47dad909ed3590ac9453d70d8a2c2231d78d8d1d99153a456d,AMD64,"Micro-Star International Co., Ltd.","Micro-Star International Co., Ltd.","Micro-Star International Co., Ltd.",Pr,Dell Inc. 0W23H8,REV:1.0,SKU,System Product Name,System Version,"American Megatrends International, LLC.",2.5,20/01/2024
HighConfidenceBuckets_part3.csv:66521: 257ee28b7946dbe7fbb0cfab40745a3c09b492029200c2c89e5c5d09d92ad519,AMD64,Dell Inc.,Dell Inc.,Dell Inc.,PowerEdge,0W23H8,A02,SKU=NotProvided;ModelName=PowerEdge R640,PowerEdge R640,,Dell Inc.,2.5.4,01/13/2020
HighConfidenceBuckets_part4.csv:62061: 3f8b7c40dd611d9ca70e9315cdad0ffb05ec1e67f4469cb7b58c746f4ba03c2e,AMD64,Dell Inc.,Dell Inc.,Dell Inc.,PowerEdge,0W23H8,A02,SKU=0716;ModelName=PowerEdge R640,PowerEdge R640,,Dell Inc.,2.21.2,02/19/2024

Obviously MSI doesn't manufacture a Dell product (or at least self-identifies as Dell). Presumably the script that parses the source telemetry data is having problems with UTF8-encoded fields?

Expected Behavior

No errors in the CSV files.

Steps To Reproduce

Searching for selected vendors (ie. Dell) in the CSV files, and filtering by OEMName, OEMModelNumber, and FirmwareVersion.

Build Environment

Any OS.

Version Information

Tag: aa2867d

Urgency

Low

Are you going to fix this?

Someone else needs to fix it

Do you need maintainer feedback?

No maintainer feedback needed

Anything else?

No response

Metadata

Metadata

Assignees

Labels

state:needs-triageNeeds to triaged to determine next stepstype:bugSomething isn't workingurgency:lowLittle to no impact

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions