Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
1,083 changes: 1,083 additions & 0 deletions 2025 国赛/C题/metax-deepseek-r1/cleaned_data.csv

Large diffs are not rendered by default.

Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
56 changes: 56 additions & 0 deletions 2025 国赛/C题/metax-deepseek-r1/final_diagnostic_report.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,56 @@
最终诊断报告
==================================================
数据文件: 附件.xlsx
原始样本数: 1082
有效样本数: 100

前5行数据:
序号 孕妇代码 年龄 身高 体重 末次月经 IVF妊娠 检测日期 检测抽血次数 检测孕周 孕妇BMI 原始读段数 在参考基因组上比对的比例 重复读段的比例 唯一比对的读段数 GC含量 13号染色体的Z值 18号染色体的Z值 21号染色体的Z值 X染色体的Z值 Y染色体的Z值 Y染色体浓度 X染色体浓度 13号染色体的GC含量 18号染色体的GC含量 21号染色体的GC含量 被过滤掉读段数的比例 染色体的非整倍体 怀孕次数 生产次数 胎儿是否健康 检测孕周_numeric 孕妇BMI_numeric Y染色体浓度_numeric
0 1 A001 31 160.0 72.0 2023-02-01 00:00:00 自然受孕 20230429 1 11w+6 28.125000 5040534 0.806726 0.027603 3845411 0.399262 0.782097 -2.321212 -1.026003 -0.062103 -1.035610 0.025936 0.038061 0.377069 0.389803 0.399399 0.027484 NaN 1 0 是 NaN 28.125000 0.025936
1 2 A001 31 160.0 73.0 2023-02-01 00:00:00 自然受孕 20230531 2 15w+6 28.515625 3198810 0.806393 0.028271 2457402 0.393299 0.692856 1.168521 -2.595099 0.582183 -0.363519 0.034887 0.059572 0.371542 0.384771 0.391706 0.019617 NaN 1 0 是 NaN 28.515625 0.034887
2 3 A001 31 160.0 73.0 2023-02-01 00:00:00 自然受孕 20230625 3 20w+1 28.515625 3848846 0.803858 0.032596 2926292 0.399890 -0.888702 -1.018236 -1.308662 -0.342564 -0.734503 0.066171 0.075995 0.377449 0.390582 0.399480 0.022312 NaN 1 0 是 NaN 28.515625 0.066171
3 4 A001 31 160.0 74.0 2023-02-01 00:00:00 自然受孕 20230716 4 22w+6 28.906250 5960269 0.802535 0.034762 4509561 0.397977 0.498031 0.770401 -1.476955 1.141242 0.476200 0.061192 0.052305 0.375613 0.389251 0.397212 0.023280 NaN 1 0 是 NaN 28.906250 0.061192
4 5 A002 32 149.0 74.0 2023-11-09 00:00:00 自然受孕 20240219 1 13w+6 33.331832 4154302 0.805008 0.028855 3169114 0.403060 -2.268039 -1.004015 0.863198 -0.441235 -0.889422 0.059230 0.059708 0.380260 0.393618 0.404868 0.024212 NaN 2 1 否 NaN 33.331832 0.059230

列统计:
- 序号: int64, 缺失值: 0/1082
- 孕妇代码: object, 缺失值: 0/1082
- 年龄: int64, 缺失值: 0/1082
- 身高: float64, 缺失值: 0/1082
- 体重: float64, 缺失值: 0/1082
- 末次月经: object, 缺失值: 12/1082
- IVF妊娠: object, 缺失值: 0/1082
- 检测日期: object, 缺失值: 0/1082
- 检测抽血次数: int64, 缺失值: 0/1082
- 检测孕周: object, 缺失值: 0/1082
- 孕妇BMI: float64, 缺失值: 0/1082
- 原始读段数: int64, 缺失值: 0/1082
- 在参考基因组上比对的比例: float64, 缺失值: 0/1082
- 重复读段的比例: float64, 缺失值: 0/1082
- 唯一比对的读段数 : int64, 缺失值: 0/1082
- GC含量: float64, 缺失值: 0/1082
- 13号染色体的Z值: float64, 缺失值: 0/1082
- 18号染色体的Z值: float64, 缺失值: 0/1082
- 21号染色体的Z值: float64, 缺失值: 0/1082
- X染色体的Z值: float64, 缺失值: 0/1082
- Y染色体的Z值: float64, 缺失值: 0/1082
- Y染色体浓度: float64, 缺失值: 0/1082
- X染色体浓度: float64, 缺失值: 0/1082
- 13号染色体的GC含量: float64, 缺失值: 0/1082
- 18号染色体的GC含量: float64, 缺失值: 0/1082
- 21号染色体的GC含量: float64, 缺失值: 0/1082
- 被过滤掉读段数的比例: float64, 缺失值: 0/1082
- 染色体的非整倍体: object, 缺失值: 956/1082
- 怀孕次数: object, 缺失值: 0/1082
- 生产次数: int64, 缺失值: 0/1082
- 胎儿是否健康: object, 缺失值: 0/1082
- 检测孕周_numeric: float64, 缺失值: 1082/1082
- 孕妇BMI_numeric: float64, 缺失值: 0/1082
- Y染色体浓度_numeric: float64, 缺失值: 0/1082

建模结果:
R-squared = 0.0765
孕周系数 p值 = 0.0056
BMI系数 p值 = 0.9701

⚠️ 警告: 使用了演示数据而非真实数据
34 changes: 34 additions & 0 deletions 2025 国赛/C题/metax-deepseek-r1/group_analysis_results.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,34 @@
BMI分组分析结果:
孕周 BMI Y染色体浓度 年龄 BMI分组
223 20.0 40.404040 0.059986 26 >40
224 24.0 40.404040 0.107418 25 >40
226 16.0 43.510937 0.077037 27 >40
227 20.0 43.906491 0.117990 27 >40
228 24.0 44.697599 0.215065 27 >40
323 12.0 46.875000 0.013157 30 >40
324 15.0 46.875000 0.022604 30 >40
405 23.0 40.648877 0.063422 27 >40
419 13.0 41.132812 0.033910 31 >40
420 15.0 41.523438 0.020269 31 >40
421 19.0 42.382812 0.051052 31 >40
422 23.0 42.968750 0.065884 31 >40
473 12.0 40.138408 0.051353 32 >40
474 15.0 40.484429 0.047096 32 >40
475 19.0 40.830450 0.027698 32 >40
476 23.0 40.830450 0.126525 32 >40
495 22.0 44.982699 0.052179 32 >40
669 23.0 45.714286 0.040248 34 >40

逻辑回归参数:
const 0.000000
BMI -0.377692
年龄 -0.036091
dtype: float64

按BMI分组的平均达标概率:
BMI分组 平均达标概率
0 20-28 3.598196e-05
1 28-32 4.219598e-06
2 32-36 1.153090e-06
3 36-40 2.843785e-07
4 >40 4.608776e-08
Loading