The lack of well-labeled training image data has significantly impeded the development of novel DL methods in mineral thin section images identification. However, image annotation, especially pixel-wise annotation is always a costly process. Manually creating dense semantic labels for rock thin section images has been long considered as an unprecedented challenge in view of the ubiquitous variety and complexity of minerals in thin sections. To speed up the annotation, we propose a human–computer collaborative pipeline in which superpixel segmentation is used as a boundary extractor to avoid hand delineation of instances boundaries. Thin section data sets also pose specific requirements to superpixel segmentation algorithms. The most obvious aspect is that thin section data sets contain more than just a single image, due to the combined use of plane-polarized light views and cross-polarized views at different angles. Due to this limitation, we implemented the extension of an existing algorithm, SLIC, to use multiple image layers, resulting in the adapted algorithm MultiSLIC. We evaluated several algorithms with respect to their use for the specific requirements of thin section data sets, Both qualitative and quantitative evaluation studies show an overall good performance of MultiSLIC in terms of boundary adherence and compactness of the resulting segmentation. This is an important aspect for the subsequent labeling by human annotators with a specifically designed labeling tool. We also provide a prototype of superpixel labeling tool in python.