Abstract
Graphical abstract
Display Omitted
Highlights
• | A new research task of disease-location recognition based on tongue image is researched, the first large-scale clinical tongue image dataset with the diagnostic label is established. | ||||
• | A novel inner-imaging channel-wise attention mechanism is proposed, it reduces the redundancy of the CNNs model and improves its modeling efficiency. | ||||
• | A dynamic regional pooling mechanism is proposed. Several specific areas are intercepted from one feature map to form multiple concentrated signals and remove edge noise. |
Abstract
Objective
Using the deep learning model to realize tongue image-based disease location recognition and focus on solving two problems: 1. The ability of the general convolution network to model detailed regional tongue features is weak; 2. Ignoring the group relationship between convolution channels, which caused the high redundancy of the model.
Methods
To enhance the convolutional neural networks. In this paper, a stochastic region pooling method is proposed to gain detailed regional features. Also, an inner-imaging channel relationship modeling method is proposed to model multi-region relations on all channels. Moreover, we combine it with the spatial attention mechanism.
Results
The tongue image dataset with the clinical disease-location label is established. Abundant experiments are carried out on it. The experimental results show that the proposed method can effectively model the regional details of tongue image and improve the performance of disease location recognition.
Conclusion
In this paper, we construct the tongue image dataset with disease-location labels to mine the relationship between tongue images and disease locations. A novel fully-channel regional attention network is proposed to model the local detail tongue features and improve the modeling efficiency.
Significance
The applications of deep learning in tongue image disease-location recognition and the proposed innovative models have guiding significance for other assistant diagnostic tasks. The proposed model provides an example of efficient modeling of detailed tongue features, which is of great guiding significance for other auxiliary diagnosis applications.
- [1] , Prudent: a pruned and confident stacking approach for multi-label classification, IEEE Trans Knowl Data Eng 27 (2015) 2480–2493.Google Scholar
- [2] , Comparative effectiveness of convolutional neural network (cnn) and recurrent neural network (rnn) architectures for radiology text report classification, Artif Intell Med 97 (2019) 79–88.Google ScholarDigital Library
- [3] , Classification of glomerular hypercellularity using convolutional features and support vector machine, Artif Intell Med 103 (2020) 101808.Google Scholar
- [4] , Aˆ 2-nets: double attention networks, Neural Inf Process Syst (2018) 352–361.Google Scholar
- [5] , A novel approach based on computerized image analysis for traditional chinese medical diagnosis of the tongue, Comput Methods Programs Biomed 61 (2000) 77–89.Google ScholarCross Ref
- [6] , Xception: deep learning with depthwise separable convolutions, Proceedings of the IEEE conference on computer vision and pattern recognition (2017) 1800–1807.Google Scholar
- [7] , 3d data denoising via nonlocal means filter by using parallel gpu strategies. Computational and mathematical methods in medicine 2014, 2014.Google Scholar
- [8] , Expanding public health in China: an empirical analysis of healthcare inputs and outputs, Public Health 142 (2017) 73–84.Google Scholar
- [9] , Imagenet: a large-scale hierarchical image database, Proceedings of the IEEE conference on computer vision and pattern recognition (2009) 248–255.Google Scholar
- [10] , Classifying cancer pathology reports with hierarchical self-attention networks, Artif Intell Med 101 (2019) 101726.Google ScholarDigital Library
- [11] , Skin lesion classification using cnns with patch-based attention and diagnosis-guided loss weighting, IEEE Trans Biomed Eng (2019).Google Scholar
- [12] , Discriminative methods for multi-labeled classification, Pacific-Asia conference on knowledge discovery and data mining (2004) 22–30.Google Scholar
- [13] , Cnn-based projected gradient descent for consistent ct image reconstruction, IEEE Trans Med Imaging 37 (2018) 1440–1453.Google Scholar
- [14] , Deep residual learning for image recognition, Proceedings of the IEEE conference on computer vision and pattern recognition (2016) 770–778.Google Scholar
- [15] , Identity mappings in deep residual networks, European conference on computer vision (2016) 630–645.Google Scholar
- [16] , Gather-excite: exploiting feature context in convolutional neural networks, Neural Inf Process Syst (2018) 9401–9411.Google Scholar
- [17] , Squeeze-and-excitation networks, Proceedings of the IEEE conference on computer vision and pattern recognition (2018).Google Scholar
- [18] , Automated tongue diagnosis on the smartphone and its applications, Comput Methods Programs Biomed 174 (2017) 51–64.Google Scholar
- [19] , Automatic construction of chinese herbal prescriptions from tongue images using cnns and auxiliary latent therapy topics, IEEE Trans Cybern (2019).Google Scholar
- [20] , Densely connected convolutional networks, Proceedings of the IEEE conference on computer vision and pattern recognition (2017) 2261–2269.Google Scholar
- [21] , Batch normalization: accelerating deep network training by reducing internal covariate shift, International conference on machine learning (2015) 448–456.Google Scholar
- [22] , Medical images fusion by using weighted least squares filter and sparse representation, Comput Electr Eng 67 (2018) 252–266.Google ScholarDigital Library
- [23] , Oscillometric blood pressure estimation by combining nonparametric bootstrap with gaussian mixture model, Comput Biol Med 85 (2017) 112–124.Google Scholar
- [24] , Eac-net: deep nets with enhancing and cropping for facial action unit detection, IEEE Trans Pattern Anal Mach Intell (2018) 1.Google Scholar
- [25] , Multi-resolution convolutional networks for chest x-ray radiograph based lung nodule detection, Artif Intell Med (2019) 101744.Google Scholar
- [26] , Selective kernel networks, Proceedings of the IEEE conference on computer vision and pattern recognition (2019) 510–519.Google Scholar
- [27] , Tooth-marked tongue recognition using multiple instance learning and cnn features, IEEE Trans Cybern 49 (2018) 380–387.Google Scholar
- [28] , Joint magnetic calibration and localization based on expectation maximization for tongue tracking, IEEE Trans Biomed Eng 65 (2018) 52–63.Google Scholar
- [29] , Complexity perception classification method for tongue constitution recognition, Artif Intell Med 96 (2019) 123–133.Google Scholar
- [30] , Pcfnet: deep neural network with predefined convolutional filters, Neurocomputing 382 (2020) 32–39.Google Scholar
- [31] , Fully-automated deep learning-powered system for dce-mri analysis of brain tumors, Artif Intell Med 102 (2020) 101769.Google Scholar
- [32] , Pulmonary artery-vein classification in ct images using deep learning, IEEE Trans Med Imaging 37 (2018) 2428–2440.Google Scholar
- [33] , Attentive systems: a survey, Int J Comput Vis 126 (2018) 86–110.Google Scholar
- [34] , Computerized tongue diagnosis based on bayesian networks, IEEE Trans Biomed Eng 51 (2004) 1803–1810.Google Scholar
- [35] , Multi-planar 3d breast segmentation in mri via deep convolutional neural networks, Artif Intell Med 103 (2020) 101781.Google Scholar
- [36] , 3d-cnn based discrimination of schizophrenia using resting-state fmri, Artif Intell Med 98 (2019) 10–17.Google Scholar
- [37] , A multi-context cnn ensemble for small lesion detection, Artif Intell Med 103 (2020) 101749.Google Scholar
- [38] , A deep neural network-based permanent magnet localization for tongue tracking, IEEE Sens J (2019).Google Scholar
- [39] , Grad-cam: visual explanations from deep networks via gradient-based localization, International conference on computer vision (2017) 618–626.Google Scholar
- [40] , Basic theory of traditional chinese medicine, Chin J Integr Tradit West Med 17 (1997) 643.Google Scholar
- [41] , Very deep convolutional networks for large-scale image recognition, International conference on learning representations (2015).Google Scholar
- [42] , Tooth-marked tongue recognition using gradient-weighted class activation maps, Fut Internet 11 (2019) 45.Google Scholar
- [43] , Rethinking the inception architecture for computer vision, Proceedings of the IEEE conference on computer vision and pattern recognition (2016) 2818–2826.Google Scholar
- [44] , Do chinese hospital services constitute an oligopoly? Evidence of the rich-club phenomenon in a patient referral network, Fut Gener Comput Syst 105 (2020) 492–501.Google Scholar
- [45] , Statistical analysis of tongue images for feature extraction and diagnostics, IEEE Trans Image Process 22 (2013) 5336–5347.Google Scholar
- [46] , A new tongue colorchecker design by space representation for precise correction, IEEE J Biomed Health Inform 17 (2013) 381–391.Google Scholar
- [47] , Reconstruction of high-resolution tongue volumes from mri, IEEE Trans Biomed Eng 59 (2012) 3511–3524.Google Scholar
- [48] , Cbam: convolutional block attention module, European conference on computer vision (2018) 3–19.Google Scholar
- [49] , Aggregated residual transformations for deep neural networks, Proceedings of the IEEE conference on computer vision and pattern recognition (2017) 5987–5995.Google Scholar
- [50] , Knowledge-based collaborative deep learning for benign-malignant lung nodule classification on chest ct, IEEE Trans Med Imaging 38 (2018) 991–1004.Google Scholar
- [51] , Cracked tongue recognition based on deep features and multiple-instance svm, Pacific rim conference on multimedia (2018) 642–652.Google Scholar
- [52] , Detecting diabetes mellitus and nonproliferative diabetic retinopathy using tongue color, texture, and geometry features, IEEE Trans Biomed Eng 61 (2014) 491–501.Google Scholar
- [53] , Introduction to tongue image analysis, Springer Singapore, Singapore, 2017, pp. 3–18. chapter 1.Google Scholar
- [54] , Shufflenet: an extremely efficient convolutional neural network for mobile devices, Proceedings of the IEEE conference on computer vision and pattern recognition (2018) 6848–6856.Google Scholar
- [55] , Learning deep features for discriminative localization, Proceedings of the IEEE conference on computer vision and pattern recognition (2016) 2921–2929.Google Scholar
- [56] , Tonguenet: accurate localization and segmentation for tongue images using deep neural networks, IEEE Access 7 (2019) 148779–148789.Google Scholar
- [57] , A k-plsr-based color correction method for tcm tongue images under different illumination conditions, Neurocomputing 174 (2016) 815–821.Google Scholar
Index Terms
(auto-classified)Fully-channel regional attention network for disease-location recognition with tongue images
Comments