Abstract The cloud vertical structure (CVS) is important, yet operational CVS products depend on active observation or reanalysis fields, limiting high‐frequency monitoring. In this study, we propose a lightweight and satellite‐only model that reconstructs volumetric cloud masks from geostationary multispectral imagery. This method employs a compact one‐dimensional convolutional neural network that combines three convolutional layers, channel attention and L1 regularization, which is trained on CALIPSO/CloudSat joint profiles and Himawari‐8 multispectral observations. The network produces per‐pixel 38‐layer cloud masks at 500 m vertical resolution and attains strong performance (Intersection over Union = 0.8730; mean absolute error of cloud thicknes = 0.4651 km; cloud top height bias ≈453.25 m). Ablation experiments demonstrate that the chosen architecture and regularization considerably improve layer discrimination. A case study of Typhoon Yutu shows that the reconstructed three‐dimensional structure is consistent with active‐sensor profiles. This observation‐only retrieval reconstructs CVS independent of meteorological inputs, avoiding potential double‐use of geostationary data.
Yang et al. (Tue,) studied this question.