research-article

Fully-channel regional attention network for disease-location recognition with tongue images

Authors Info & Claims
Published:01 August 2021Publication History
Skip Abstract Section

Abstract

Graphical abstract

Display Omitted

Highlights

A new research task of disease-location recognition based on tongue image is researched, the first large-scale clinical tongue image dataset with the diagnostic label is established.

A novel inner-imaging channel-wise attention mechanism is proposed, it reduces the redundancy of the CNNs model and improves its modeling efficiency.

A dynamic regional pooling mechanism is proposed. Several specific areas are intercepted from one feature map to form multiple concentrated signals and remove edge noise.

Abstract

Skip Objective Section

Objective

Using the deep learning model to realize tongue image-based disease location recognition and focus on solving two problems: 1. The ability of the general convolution network to model detailed regional tongue features is weak; 2. Ignoring the group relationship between convolution channels, which caused the high redundancy of the model.

Skip Methods Section

Methods

To enhance the convolutional neural networks. In this paper, a stochastic region pooling method is proposed to gain detailed regional features. Also, an inner-imaging channel relationship modeling method is proposed to model multi-region relations on all channels. Moreover, we combine it with the spatial attention mechanism.

Skip Results Section

Results

The tongue image dataset with the clinical disease-location label is established. Abundant experiments are carried out on it. The experimental results show that the proposed method can effectively model the regional details of tongue image and improve the performance of disease location recognition.

Skip Conclusion Section

Conclusion

In this paper, we construct the tongue image dataset with disease-location labels to mine the relationship between tongue images and disease locations. A novel fully-channel regional attention network is proposed to model the local detail tongue features and improve the modeling efficiency.

Skip Significance Section

Significance

The applications of deep learning in tongue image disease-location recognition and the proposed innovative models have guiding significance for other assistant diagnostic tasks. The proposed model provides an example of efficient modeling of detailed tongue features, which is of great guiding significance for other auxiliary diagnosis applications.

References

  1. [1] Alali A., Kubat M., Prudent: a pruned and confident stacking approach for multi-label classification, IEEE Trans Knowl Data Eng 27 (2015) 24802493.Google ScholarGoogle Scholar
  2. [2] Banerjee I., Ling Y., Chen M.C., Hasan S.A., Langlotz C.P., Moradzadeh N., et al., Comparative effectiveness of convolutional neural network (cnn) and recurrent neural network (rnn) architectures for radiology text report classification, Artif Intell Med 97 (2019) 7988.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. [3] Chagas P., Souza L., Araújo I., Aldeman N., Duarte A., Angelo M., et al., Classification of glomerular hypercellularity using convolutional features and support vector machine, Artif Intell Med 103 (2020) 101808.Google ScholarGoogle Scholar
  4. [4] Chen Y., Kalantidis Y., Li J., Yan S., Feng J., Aˆ 2-nets: double attention networks, Neural Inf Process Syst (2018) 352361.Google ScholarGoogle Scholar
  5. [5] Chiu C.C., A novel approach based on computerized image analysis for traditional chinese medical diagnosis of the tongue, Comput Methods Programs Biomed 61 (2000) 7789.Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] Chollet F., Xception: deep learning with depthwise separable convolutions, Proceedings of the IEEE conference on computer vision and pattern recognition (2017) 18001807.Google ScholarGoogle Scholar
  7. [7] Cuomo S., De Michele P., Piccialli F., 3d data denoising via nonlocal means filter by using parallel gpu strategies. Computational and mathematical methods in medicine 2014, 2014.Google ScholarGoogle Scholar
  8. [8] Deng F., Lv J., Wang H., Gao J., Zhou Z., Expanding public health in China: an empirical analysis of healthcare inputs and outputs, Public Health 142 (2017) 7384.Google ScholarGoogle Scholar
  9. [9] Deng J., Dong W., Socher R., Li L., Li K., Feifei L., Imagenet: a large-scale hierarchical image database, Proceedings of the IEEE conference on computer vision and pattern recognition (2009) 248255.Google ScholarGoogle Scholar
  10. [10] Gao S., Qiu J.X., Alawad M., Hinkle J.D., Schaefferkoetter N., Yoon H.J., et al., Classifying cancer pathology reports with hierarchical self-attention networks, Artif Intell Med 101 (2019) 101726.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] Gessert N., Sentker T., Madesta F., Schmitz R., Kniep H., Baltruschat I., et al., Skin lesion classification using cnns with patch-based attention and diagnosis-guided loss weighting, IEEE Trans Biomed Eng (2019).Google ScholarGoogle Scholar
  12. [12] Godbole S., Sarawagi S., Discriminative methods for multi-labeled classification, Pacific-Asia conference on knowledge discovery and data mining (2004) 2230.Google ScholarGoogle Scholar
  13. [13] Gupta H., Jin K.H., Nguyen H.Q., Mccann M.T., Unser M., Cnn-based projected gradient descent for consistent ct image reconstruction, IEEE Trans Med Imaging 37 (2018) 14401453.Google ScholarGoogle Scholar
  14. [14] He K., Zhang X., Ren S., Sun J., Deep residual learning for image recognition, Proceedings of the IEEE conference on computer vision and pattern recognition (2016) 770778.Google ScholarGoogle Scholar
  15. [15] He K., Zhang X., Ren S., Sun J., Identity mappings in deep residual networks, European conference on computer vision (2016) 630645.Google ScholarGoogle Scholar
  16. [16] Hu J., Shen L., Albanie S., Sun G., Vedaldi A., Gather-excite: exploiting feature context in convolutional neural networks, Neural Inf Process Syst (2018) 94019411.Google ScholarGoogle Scholar
  17. [17] Hu J., Shen L., Sun G., Squeeze-and-excitation networks, Proceedings of the IEEE conference on computer vision and pattern recognition (2018).Google ScholarGoogle Scholar
  18. [18] Hu M.C., Lan K.C., Fang W.C., Huang Y.C., Ho T.J., Lin C.P., et al., Automated tongue diagnosis on the smartphone and its applications, Comput Methods Programs Biomed 174 (2017) 5164.Google ScholarGoogle Scholar
  19. [19] Hu Y., Wen G., Liao H., Wang C., Dai D., Yu Z., Automatic construction of chinese herbal prescriptions from tongue images using cnns and auxiliary latent therapy topics, IEEE Trans Cybern (2019).Google ScholarGoogle Scholar
  20. [20] Huang G., Liu Z., Der Maaten L.V., Weinberger K.Q., Densely connected convolutional networks, Proceedings of the IEEE conference on computer vision and pattern recognition (2017) 22612269.Google ScholarGoogle Scholar
  21. [21] Ioffe S., Szegedy C., Batch normalization: accelerating deep network training by reducing internal covariate shift, International conference on machine learning (2015) 448456.Google ScholarGoogle Scholar
  22. [22] Jiang W., Yang X., Wu W., Liu K., Ahmad A., Sangaiah A.K., et al., Medical images fusion by using weighted least squares filter and sparse representation, Comput Electr Eng 67 (2018) 252266.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. [23] Lee S., Rajan S., Jeon G., Chang J.H., Dajani H.R., Groza V.Z., Oscillometric blood pressure estimation by combining nonparametric bootstrap with gaussian mixture model, Comput Biol Med 85 (2017) 112124.Google ScholarGoogle Scholar
  24. [24] Li W., Abtahi F., Zhu Z., Yin L., Eac-net: deep nets with enhancing and cropping for facial action unit detection, IEEE Trans Pattern Anal Mach Intell (2018) 1.Google ScholarGoogle Scholar
  25. [25] Li X., Shen L., Xie X., Huang S., Xie Z., Hong X., et al., Multi-resolution convolutional networks for chest x-ray radiograph based lung nodule detection, Artif Intell Med (2019) 101744.Google ScholarGoogle Scholar
  26. [26] Li X., Wang W., Hu X., Yang J., Selective kernel networks, Proceedings of the IEEE conference on computer vision and pattern recognition (2019) 510519.Google ScholarGoogle Scholar
  27. [27] Li X., Zhang Y., Cui Q., Yi X., Zhang Y., Tooth-marked tongue recognition using multiple instance learning and cnn features, IEEE Trans Cybern 49 (2018) 380387.Google ScholarGoogle Scholar
  28. [28] Lu J., Yang Z., Okkelberg K.Z., Ghovanloo M., Joint magnetic calibration and localization based on expectation maximization for tongue tracking, IEEE Trans Biomed Eng 65 (2018) 5263.Google ScholarGoogle Scholar
  29. [29] Ma J., Wen G., Wang C., Jiang L., Complexity perception classification method for tongue constitution recognition, Artif Intell Med 96 (2019) 123133.Google ScholarGoogle Scholar
  30. [30] Ma Y., Luo Y., Yang Z., Pcfnet: deep neural network with predefined convolutional filters, Neurocomputing 382 (2020) 3239.Google ScholarGoogle Scholar
  31. [31] Nalepa J., Lorenzo P.R., Marcinkiewicz M., Bobek-Billewicz B., Wawrzyniak P., Walczak M., et al., Fully-automated deep learning-powered system for dce-mri analysis of brain tumors, Artif Intell Med 102 (2020) 101769.Google ScholarGoogle Scholar
  32. [32] Nardelli P., Jimenez-Carretero D., Bermejo-Pelaez D., Washko G.R., Rahaghi F.N., Ledesma-Carbayo M.J., et al., Pulmonary artery-vein classification in ct images using deep learning, IEEE Trans Med Imaging 37 (2018) 24282440.Google ScholarGoogle Scholar
  33. [33] Nguyen T.V., Zhao Q., Yan S., Attentive systems: a survey, Int J Comput Vis 126 (2018) 86110.Google ScholarGoogle Scholar
  34. [34] Pang B., Zhang D., Li N., Wang K., Computerized tongue diagnosis based on bayesian networks, IEEE Trans Biomed Eng 51 (2004) 18031810.Google ScholarGoogle Scholar
  35. [35] Piantadosi G., Sansone M., Fusco R., Sansone C., Multi-planar 3d breast segmentation in mri via deep convolutional neural networks, Artif Intell Med 103 (2020) 101781.Google ScholarGoogle Scholar
  36. [36] Qureshi M.N.I., Oh J., Lee B., 3d-cnn based discrimination of schizophrenia using resting-state fmri, Artif Intell Med 98 (2019) 1017.Google ScholarGoogle Scholar
  37. [37] Savelli B., Bria A., Molinara M., Marrocco C., Tortorella F., A multi-context cnn ensemble for small lesion detection, Artif Intell Med 103 (2020) 101749.Google ScholarGoogle Scholar
  38. [38] Sebkhi N., Sahadat N., Hersek S., Bhavsar A., Siahpoushan S., Ghovanloo M., et al., A deep neural network-based permanent magnet localization for tongue tracking, IEEE Sens J (2019).Google ScholarGoogle Scholar
  39. [39] Selvaraju R.R., Cogswell M., Das A., Vedantam R., Parikh D., Batra D., Grad-cam: visual explanations from deep networks via gradient-based localization, International conference on computer vision (2017) 618626.Google ScholarGoogle Scholar
  40. [40] Shen Z.Y., Basic theory of traditional chinese medicine, Chin J Integr Tradit West Med 17 (1997) 643.Google ScholarGoogle Scholar
  41. [41] Simonyan K., Zisserman A., Very deep convolutional networks for large-scale image recognition, International conference on learning representations (2015).Google ScholarGoogle Scholar
  42. [42] Sun Y., Dai S., Li J., Zhang Y., Li X., Tooth-marked tongue recognition using gradient-weighted class activation maps, Fut Internet 11 (2019) 45.Google ScholarGoogle Scholar
  43. [43] Szegedy C., Vanhoucke V., Ioffe S., Shlens J., Wojna Z., Rethinking the inception architecture for computer vision, Proceedings of the IEEE conference on computer vision and pattern recognition (2016) 28182826.Google ScholarGoogle Scholar
  44. [44] Tang C., Dong X., Lian Y., Tang D., Do chinese hospital services constitute an oligopoly? Evidence of the rich-club phenomenon in a patient referral network, Fut Gener Comput Syst 105 (2020) 492501.Google ScholarGoogle Scholar
  45. [45] Wang X., Zhang B., Yang Z., Wang H., Zhang D., Statistical analysis of tongue images for feature extraction and diagnostics, IEEE Trans Image Process 22 (2013) 53365347.Google ScholarGoogle Scholar
  46. [46] Wang X., Zhang D., A new tongue colorchecker design by space representation for precise correction, IEEE J Biomed Health Inform 17 (2013) 381391.Google ScholarGoogle Scholar
  47. [47] Woo J., Murano E.Z., Stone M., Prince J.L., Reconstruction of high-resolution tongue volumes from mri, IEEE Trans Biomed Eng 59 (2012) 35113524.Google ScholarGoogle Scholar
  48. [48] Woo S., Park J., Lee J., Kweon I.S., Cbam: convolutional block attention module, European conference on computer vision (2018) 319.Google ScholarGoogle Scholar
  49. [49] Xie S., Girshick R.B., Dollar P., Tu Z., He K., Aggregated residual transformations for deep neural networks, Proceedings of the IEEE conference on computer vision and pattern recognition (2017) 59875995.Google ScholarGoogle Scholar
  50. [50] Xie Y., Xia Y., Zhang J., Song Y., Feng D., Fulham M., et al., Knowledge-based collaborative deep learning for benign-malignant lung nodule classification on chest ct, IEEE Trans Med Imaging 38 (2018) 9911004.Google ScholarGoogle Scholar
  51. [51] Xue Y., Li X., Cui Q., Wang L., Wu P., Cracked tongue recognition based on deep features and multiple-instance svm, Pacific rim conference on multimedia (2018) 642652.Google ScholarGoogle Scholar
  52. [52] Zhang B., Kumar B.V., Zhang D., Detecting diabetes mellitus and nonproliferative diabetic retinopathy using tongue color, texture, and geometry features, IEEE Trans Biomed Eng 61 (2014) 491501.Google ScholarGoogle Scholar
  53. [53] Zhang D., Zhang H., Zhang B., Introduction to tongue image analysis, Springer Singapore, Singapore, 2017, pp. 318. chapter 1.Google ScholarGoogle Scholar
  54. [54] Zhang X., Zhou X., Lin M., Sun J., Shufflenet: an extremely efficient convolutional neural network for mobile devices, Proceedings of the IEEE conference on computer vision and pattern recognition (2018) 68486856.Google ScholarGoogle Scholar
  55. [55] Zhou B., Khosla A., Lapedriza A., Oliva A., Torralba A., Learning deep features for discriminative localization, Proceedings of the IEEE conference on computer vision and pattern recognition (2016) 29212929.Google ScholarGoogle Scholar
  56. [56] Zhou C., Fan H., Li Z., Tonguenet: accurate localization and segmentation for tongue images using deep neural networks, IEEE Access 7 (2019) 148779148789.Google ScholarGoogle Scholar
  57. [57] Zhuo L., Zhang P., Qu P., Peng Y., Zhang J., Li X., A k-plsr-based color correction method for tcm tongue images under different illumination conditions, Neurocomputing 174 (2016) 815821.Google ScholarGoogle Scholar

Index Terms

(auto-classified)
  1. Fully-channel regional attention network for disease-location recognition with tongue images

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        About Cookies On This Site

        We use cookies to ensure that we give you the best experience on our website.

        Learn more

        Got it!