refactor: separate the weight loading in the npu layer class. #489

Clement-Wang26 · 2025-12-05T09:04:01Z

No description provided.

XuZhang99 · 2025-12-05T12:41:46Z

xllm/core/layers/npu/loader/base_loader.cpp

+                            int weight_position) {
+  for (const auto& [name, tensor] : state_dict) {
+    if (absl::EndsWith(name, tensor_name)) {
+      at::Tensor mutable_tensor = tensor;


change all at:: to torch::.

XuZhang99 · 2025-12-05T12:42:53Z

xllm/core/layers/npu/loader/base_loader.cpp

+
+void BaseLoader::set_weight(const StateDict& state_dict,
+                            const std::string& tensor_name,
+                            int weight_position,


clarify whether it is int32_t or int64_t.

XuZhang99 · 2025-12-05T12:47:16Z

xllm/core/layers/npu/loader/base_loader.cpp

+}
+
+torch::Dtype BaseLoader::string2dtype(const std::string& dtype_str) {
+  if (dtype_str.compare("float16") == 0) {


use switch

XuZhang99 · 2025-12-05T12:48:09Z

xllm/core/layers/npu/loader/base_loader.cpp

+
+  if (tensor.dtype() != torch::kInt8 && tensor.dtype() != torch::kInt32 &&
+      tensor.dtype() != torch::kInt64) {
+    torch::Dtype dtype = string2dtype(torch_dtype_);


replace torch::Dtype with torch::ScalarType.

XuZhang99 · 2025-12-05T12:49:00Z

xllm/core/layers/npu/loader/column_parallel_linear_loader.cpp

+    const ModelContext& context)
+    : BaseLoader(weight_count, context) {
+  auto options = context.get_tensor_options();
+  dtype_ = c10::typeMetaToScalarType(options.dtype());


c10:: ==> torch::

XuZhang99 · 2025-12-05T12:49:56Z

xllm/core/layers/npu/loader/column_parallel_linear_loader.h

+namespace layer {
+class ColumParallelLinearLoader : public BaseLoader {
+ public:
+  explicit ColumParallelLinearLoader(uint64_t weight_count,


no need to add explicit when construct func has two params.

XuZhang99 · 2025-12-05T12:50:48Z

xllm/core/layers/npu/loader/column_parallel_linear_loader.cpp

+
+void ColumParallelLinearLoader::verify_loaded_weights(
+    const std::string& weight_str) const {
+  CHECK(at_weight_tensors_[0].sizes() != std::vector<int64_t>({1}))


nit: CHECK_EQ

XuZhang99 · 2025-12-05T12:53:08Z

xllm/core/layers/npu/loader/glm4_moe_decoder_loader.cpp

+#include <torch_npu/csrc/core/npu/NPUCachingAllocator.h>
+#include <torch_npu/csrc/core/npu/NPUException.h>
+
+#include <map>


try # include <unordered_map>

XuZhang99 · 2025-12-05T13:00:42Z

xllm/core/layers/base_layer.h

maybe we can move BaseLayer to npu dir or merge BaseLayer and NpuBaseLayer, because no other platform will use BaseLayer.

refatcor: separate the weight loading in the npu layer class.

aaa9676

Clement-Wang26 force-pushed the final_refatcor branch from eef8bfc to aaa9676 Compare December 5, 2025 09:51

Clement-Wang26 changed the title ~~refatcor: separate the weight loading in the npu layer class.~~ refactor: separate the weight loading in the npu layer class. Dec 5, 2025

Clement-Wang26 requested review from XuZhang99 and yq33victor December 5, 2025 09:53

XuZhang99 reviewed Dec 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor: separate the weight loading in the npu layer class. #489

refactor: separate the weight loading in the npu layer class. #489

Uh oh!

Clement-Wang26 commented Dec 5, 2025

Uh oh!

XuZhang99 Dec 5, 2025

Uh oh!

XuZhang99 Dec 5, 2025

Uh oh!

XuZhang99 Dec 5, 2025

Uh oh!

XuZhang99 Dec 5, 2025

Uh oh!

XuZhang99 Dec 5, 2025

Uh oh!

XuZhang99 Dec 5, 2025

Uh oh!

XuZhang99 Dec 5, 2025

Uh oh!

XuZhang99 Dec 5, 2025

Uh oh!

XuZhang99 Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

refactor: separate the weight loading in the npu layer class. #489

Are you sure you want to change the base?

refactor: separate the weight loading in the npu layer class. #489

Uh oh!

Conversation

Clement-Wang26 commented Dec 5, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants