| |
Last updated on August 24, 2022. This conference program is tentative and subject to change
Technical Program for Wednesday July 13, 2022
|
WeAT2 |
Alsh-2 |
Theme 07. Physiological and Biological Sensing 2 |
Oral Session |
Chair: O'Mahony, Conor | Tyndall National Institute, University College Cork |
Co-Chair: Daniele, Michael | North Carolina State University / UNC Chapel Hill |
|
08:30-08:45, Paper WeAT2.1 | |
Optimization of a Stacked-Design Core-Body-Temperature Sensor for Long-Period Human Trials |
|
Matsunaga, Daichi | NTT Device Technology Labs |
Tanaka, Yujiro | NTT |
Tajima, Takuro | NTT Device Technology Laboratories |
Seyama, Michiko | NTT Device Technology Labs |
Keywords: Thermal sensors and systems, Wearable wireless sensors, motes and systems, Integrated sensor systems
Abstract: We fabricated a wearable sensor that can be attached to the skin surface and continuously measure core body temperature (CBT) wirelessly over a long period. CBT is calculated from skin-surface temperature and heat flux passing through the sensor. Since heat flux is lost to the surroundings of the probe, the slightest change in convection in daily life will degrade the measurement accuracy of the sensor. Accordingly, we previously proposed a heat-flux-path control structure to reduce the absolute amount of heat-flux loss. To make wearable sensors for long-term human trials, we proposed an integrated design in which a sensor probe, a circuit board, and a battery are stacked. We optimized the proposed design by computer simulation and evaluated the fabricated sensor by a phantom experiment in which the convectional state was changed. The evaluation results demonstrate that the sensor has limits of agreement (LOA) of [−0.13; 0.03]°C under 1-m/s-wind convection. Moreover, a preliminary human trial conducted under daily-life conditions (including convectional changes) demonstrated that the sensor has LOA of [−0.18; 0.22]°C. These results demonstrate that the fabricated sensor is suitable for CBT measurement.
|
|
08:45-09:00, Paper WeAT2.2 | |
Aptasensor for Detection of Influenza-A in Human Saliva |
|
Maddocks, Grace | NC State University |
Peterson, Kaila | North Carolina State University |
Downey, McKenna | Mldowney@ncsu.edu |
Lee, Banghyun | NC State University |
Lavoie, Joseph | NC State University |
Menegatti, Stefano | NC State University |
Daniele, Michael | North Carolina State University / UNC Chapel Hill |
Keywords: Chemo/bio-sensing - Chemical sensors and systems, Chemo/bio-sensing - Biological sensors and systems, Health monitoring applications
Abstract: Access to low-cost, rapid, individualized diagnostics at point-of-care and point-of-need is vital to minimize the impact of highly infectious viruses, such as influenza. Herein, a biosensor for detecting hemagglutinin (HA), an abundant capsid protein in H1N1 viruses, is demonstrated. A gold working electrode was functionalized with a thiol-modified, HA-binding aptamer derivatized with a methylene blue modification for redox reporting. The aptamer was characterized by surface plasmon resonance to confirm its biorecognition activity for HA. The aptasensor was characterized by square wave voltammetry to quantify the sensor’s response to varying concentrations of HA. The sensor exhibited a lower limit of detection of 1.5 pM with linear detection of up to 1.2 nM in both Tris buffer and simulated human saliva, thus encompassing the clinically relevant HA range in saliva. Average sensitivity was measured at 21.083 nA·nM-1 in Tris and 14.5 nA·nM-1 in artificial saliva across clinically relevant HA titers. Sensor stability across time was also investigated, providing a preliminary understanding of the translational viability of the aptasensors for mobile and remote diagnostic applications.
|
|
09:00-09:15, Paper WeAT2.3 | |
Split Electrodes for Electrical-Conductivity-Based Tissue Discrimination |
|
Yilmaz, Gurkan | CSEM SA |
Braun, Fabian | CSEM SA |
Adler, Andy | Carleton University |
Moreira de Souza, Antonio | CSEM SA |
Ferrario, Damien | CSEM |
Lemay, Mathieu | CSEM |
Chételat, Olivier | CSEM |
Keywords: Bio-electric sensors - Sensor systems, Bio-electric sensors - Sensing methods
Abstract: This work presents a method to minimize the inadvertent cutting of tissues in surgeries involving bone drilling. We present electrical impedance measurements as an assistive technology to image-guided surgery to achieve online guidance. Proposed concept is to identify and localize the landmarks via impedance measurements and then use this information to superimpose the estimated drilling trajectory on the offline maps obtained by pre-operative imaging. To this end, we propose an asymmetric electrode geometry, split electrodes, capable of distinguishing impedance variations as a function of rotation angle. The feasibility of the proposed approach is verified with numerical analysis. A probe with stainless steel electrodes has been fabricated and tested with a technical phantom. Although the results are impacted by a non-ideality in the phantom, we could show that the variation of impedance as a function of rotation angle can be used to localize the regions with different impedivities.
|
|
09:15-09:30, Paper WeAT2.4 | |
Analysis of Interface Material Noise in Non-Contact Capacitive Sensing |
|
Zhang, Yijing | Eindhoven University of Technology |
Xu, Lin | ShanghaiTech University |
Mischi, Massimo | Eindhoven University of Technology |
Cantatore, Eugenio | Eindhoven University of Technology |
Harpe, Pieter | Eindhoven University of Technology |
Keywords: Physiological monitoring - Modeling and analysis, Wearable sensor systems - User centered design and applications, New sensing techniques
Abstract: The thermal noise due to the resistivity of insulation materials can become a significant noise source in non-contact capacitive sensing, especially when measuring micro-volt-level physiological signals. Since both the impedance and the resistivity of practical insulation materials may be strongly frequency dependent, their thermal noise is often frequency dependent. This paper studies the impedance and noise behavior of different interface materials as function of frequency, by means of modelling, simulations, and experimental measurements. The results show that the inherent resistive noise of some fabrics (e.g., cotton, polyester) could outweigh the typical noise level of circuits for physiological sensing; and as a result, the interface noise can limit the quality of low-amplitude signal detection.
|
|
09:30-09:45, Paper WeAT2.5 | |
Development and Characterization of Passivation Methods for Microneedle-Based Biosensors |
|
Bocchino, Andrea | Tyndall National Institute |
Teixeira, Sofia Rodrigues | Tyndall National Institute, University College Cork |
Iadanza, Simone | Tyndall National Institute |
Melnik, Eva | AIT Austrian Institute of Technology GmbH, Center for Health And |
Kurzhals, Steffen | AIT Austrian Institute of Technology GmbH, Center for Health And |
Mutinati, Giorgio | AIT Austrian Institute of Technology GmbH, Center for Health And |
O'Mahony, Conor | Tyndall National Institute, University College Cork |
Keywords: Chemo/bio-sensing - Chemical sensors and systems, Bio-electric sensors - Sensor systems, Chemo/bio-sensing - Biological sensors and systems
Abstract: Microneedles (MN) are short, sharp structures that have the ability to painlessly pierce the stratum corneum, the outermost layer of the skin, and interface with the dermal interstitial fluid that lies beneath. Because the interstitial fluid is rich in biomarkers, microneedle-based biosensors have the potential to be used in a wide range of diagnostic applications. To act as an electrochemical sensor, the tip or the body of the MN must be functionalized, while the substrate areas are generally passivated to block any unwanted background interference that may occur outside of the skin. This work presents four different passivation techniques, based on the application of SiO2, polymethyl methacrylate (PMMA), an adhesive film, and varnish to the substrate areas. Optical, SEM and electrochemical measurements were performed to quantitatively assess the performance of each film. The data shows that whilst manual application of varnish provided the highest level of electrical isolation, the spin-coating of a 5 m thick layer of PMMA is likely to provide the best combination of performance and manufacturability.
|
|
09:45-10:00, Paper WeAT2.6 | |
A Fully Integrated CMOS-Controlled Scalable Microfluidics and Pneumatic-Free Cell Actuation and Cytometry Sensing Device |
|
Zhu, Chengjie | Princeton University |
Maldonado, Jesus | Princeton University |
Sengupta, Kaushik | Princeton University |
Keywords: Chemo/bio-sensing - Micrototal analysis and lab-on-chip systems, Chemo/bio-sensing - Biological sensors and systems, Bio-electric sensors - Sensor systems
Abstract: Moore's law has enabled massive scaling of complex computing and sensing systems in modern-day chip-scale architectures allowing extremely high yield and system complexity at very low-cost. Exploiting such Moore's law, we explore silicon-based integrated circuits and chip-scale systems to interface with biological fluids to manipulate, sense, and detect cells in real-time for an end-to-end low cost, miniaturized, and high sensitivity point-of-care diagnostics platform. Eliminating the need for complex, expensive, large and bulky syringe pumps and optical-based cytometers, the proposed system allows pneumatic-free AC electro-osmosis bulk fluid driving capabilities controlled by the CMOS chip, and integrated dielectrophoretic cell actuation with 2μm focusing accuracy, impedance spectroscopy sensing, and separation capabilities. The paper presents, for the first-time, a CMOS-driven cellular sensing platform for microfluidics that can be translated to a wide range of biomedical applications.
|
|
WeAT3 |
Boisdale-1 |
Theme 01. Deep Learning and Neural Network Methods for Cardiac Signals |
Oral Session |
Chair: Shi, Xintong | Keio University |
Co-Chair: Chen, Wuxia | The University of Texas at San Antonio |
|
08:30-08:45, Paper WeAT3.1 | |
Effective Data Augmentation, Filters, and Automation Techniques for Automatic 12-Lead ECG Classification Using Deep Residual Neural Networks |
|
An, Junmo | Philips |
Gregg, Richard | Philips Healthcare |
Borhani, Soheil | Philips |
Keywords: Data mining and big data methods - Biosignal classification, Data mining and big data methods - Machine learning and deep learning methods, Neural networks and support vector machines in biosignal processing and classification
Abstract: Automatic electrocardiogram (ECG) analysis plays a critical role in early detection and diagnosis of cardiac abnormalities and diseases. Data augmentation and automation strategies have been proposed to enhance the robustness of the machine and deep learning model for the classification of cardiac abnormalities. Here we propose 15 data augmentation and 6 filters, and an automation method using an end-to-end deep residual neural network (ResNet) model for automatic cardiac abnormalities detection from 12-lead ECG recordings. We evaluate the effectiveness of data augmentation/filtering and automation techniques using the proposed ResNet-based model on the China Physiological Signal Challenge (CPSC) dataset consisting of 9 diagnostic classes. The average F1 scores across 9 classes on the CPSC dataset trained with three data augmentation (baseline wander addition, dropout, and scaling) and a filter (sigmoid compression) were significantly higher than that without using augmentation/filters (baseline). The highest average F1 score with sigmoid compression method was significantly higher (relative improvement of 2.04 %) than the baseline while horizontal and vertical flipping augmentations were detrimental to the classification performance. Additionally, the results show that the random combination of four selected data augmentation and filter using the modified RandAugment technique provided a significantly higher average F1 score (relative improvement of 2.54 %) compared to the baseline. The proposed data augmentation, filters, and automation technique provide an effective solution to improve the classification performance of the end-to-end deep learning model from ECG recordings without changing the model hyperparameters and structure.
|
|
08:45-09:00, Paper WeAT3.2 | |
Multiclass Convolutional Neural Networks for Atrial Fibrillation Classification |
|
Sbrollini, Agnese | Università Politecnica Delle Marche |
Tomassini, Selene | Marche Polytechnic University |
Emaldi, Enrico | Marche Polytechnic University |
Marcantoni, Ilaria | Università Politecnica Delle Marche |
Morettini, Micaela | Università Politecnica Delle Marche |
Dragoni, Aldo Franco | Marche Polytechnic University |
Burattini, Laura | Università Politecnica Delle Marche |
Keywords: Data mining and big data methods - Machine learning and deep learning methods, Data mining and big data methods - Biosignal classification, Time-frequency and time-scale analysis - Wavelets
Abstract: Atrial fibrillation (AF) is a common supraventricular arrhythmia. Its automatic identification by standard 12-lead electrocardiography (ECG) is still challenging. Recently, deep learning provided new instruments able to mimic the diagnostic ability of clinicians but only in case of binary classification (AF vs. normal sinus rhythm-NSR). However, binary classification is far from the real scenarios, where AF has to be discriminated also from several other physiological and pathological conditions. The aim of this work is to present a new AF multiclass classifier based on a convolutional neural network (CNN), able to discriminate AF from NSR, premature atrial contraction (PAC) and premature ventricular contraction (PVC). Overall, 2796 12-lead ECG recordings were selected from the open-source “PhysioNet/Computing in Cardiology Challenge 2021” database, to construct a dataset constituted by four balanced classes, namely AF class, PAC class, PVC class, and NSR class. Each lead of each ECG recording was decomposed into spectrogram by continuous wavelet transform and saved as 2D grayscale images, used to feed a 6-layers CNN. Considering the same CNN architecture, a multiclass classifiers (all classes) and three binary classifiers (AF class, PAC class, and PVC class vs. NSR class) were created and validated by a stratified shuffle split cross-validation of 10 splits. Performance was quantified in terms of area under the curve (AUC) of the receiver operating characteristic. Multiclass classifier performance was high (AF class: 96.6%; PAC class: 95.3%; PVC class: 92.8%; NSR class: 97.4%) and preferable to binary classifiers. Thus, our CNN AF multiclass classifier proved to be an efficient tool for AF discrimination from physiological and pathological confounders.
|
|
09:00-09:15, Paper WeAT3.3 | |
Residual Convolutional Autoencoder Combined with a Non-Negative Matrix Factorization to Estimate Fetal Heart Rate |
|
Lafaye de Micheaux, Hugo | Univ. Grenoble Alpes, CNRS, CHU Grenoble Alpes, Grenoble INP, TI |
Resendiz, Mariel | Univ. Grenoble Alpes, CNRS, CHU Grenoble Alpes, Grenoble INP, TI |
Rivet, Bertrand | Grenoble Universities |
Fontecave-Jallon, Julie | Univ. Grenoble Alpes, CNRS, CHU Grenoble Alpes, Grenoble INP |
Keywords: Neural networks and support vector machines in biosignal processing and classification
Abstract: The fetal heart rate (fHR) plays an important role in the determination of the good health of the fetus. Beside the traditional Doppler ultrasound technique, non-invasive fetal electrocardiography (fECG) has become an interesting alternative. However, extracting clean fECG from abdominal ECG (aECG) recordings is a challenging task due to the presence of the maternal ECG component and various noise sources. In this context, we propose a deep residual convolutional autoencoder network trained on synthetic aECG simulations followed by a transfer learning phase on real aECG recordings to extract the cleanest fECG. Afterwards, we propose to use a non-negative matrix factorization based approach on the obtained fECG to estimate the fHR. Our method is evaluated on three publicly available databases demonstrating that it can provide significant performance improvement against comparative methodologies.
|
|
09:15-09:30, Paper WeAT3.4 | |
Non-Invasive Fetal ECG Signal Quality Assessment Based on Unsupervised Learning Approach |
|
Shi, Xintong | Keio University |
Yamamoto, Kohei | Keio University |
Ohtsuki, Tomoaki | Keio University |
Matsui, Yutaka | Atom Medical Corporation |
Ohwada, Kazunari | Atom Medical Co Ltd |
Keywords: Neural networks and support vector machines in biosignal processing and classification, Data mining and big data methods - Machine learning and deep learning methods
Abstract: The non-invasive fetal electrocardiogram (FECG) derived from abdominal surface electrodes has been widely used for fetal heart rate (FHR) monitoring to assess fetal well-being. However, the accuracy of FECG-based FHR estimation heavily depends on the quality of FECG signal itself, which can generally be affected by several interference sources such as maternal heart activities and fetal movements. Hence, FECG signal quality assessment (SQA) is an essential task to improve the accuracy of FHR estimation by removing or interpolating low-quality FECG signals. In recent research, various SQA methods based on supervised learning have been proposed. Although these methods could perform accurate SQA, they require large labeled datasets. Nevertheless, the labeled datasets for the FECG SQA are very limited. In this paper, to address this limitation, we propose an unsupervised learning-based SQA method for identifying high and low-quality FECG signal segments. Specifically, a fully convolutional network (FCN)-based autoencoder (AE) is trained for reconstructing a spectrogram derived from FECG. An AE-based feature related to reconstruction error is then calculated to identify high and low-quality FECG segments. In addition, entropy-based features, statistical features, and ECG signal quality indices (SQIs) are also extracted. The high and low-quality segments are identified by feeding the extracted features into self-organizing map (SOM). The experimental results showed that our proposal achieved an accuracy of 98% in high and low-quality signal classification.
|
|
09:30-09:45, Paper WeAT3.5 | |
A Meta-Transfer Learning Approach to ECG Arrhythmia Detection |
|
Chen, Wuxia | The University of Texas at San Antonio |
Banerjee, Taposh | University of Texas at San Antonio |
John, Eugene | The University of Texas at San Antonio |
Keywords: Neural networks and support vector machines in biosignal processing and classification, Signal pattern classification
Abstract: Automatic classification of cardiac abnormalities is becoming increasingly popular with the prevalence of ECG recordings. Many signal processing and machine learning algorithms have shown the potential to identify cardiac abnormalities accurately. However, most of these methods heavily rely on a large amount of relatively homogeneous datasets. In real life, chances are that there is not enough data for a specific category, and regular deep learning may fail in this scenario. A straightforward intuition is to use the knowledge learned from previous data to solve the problem. This idea leads to learning-to-learn: extrapolating the knowledge accumulated from the old dataset and using it in a different but somewhat related dataset. In this way, we do not need to have considerable data to learn the new task because the underlying features of the old and new datasets resemble one another. In this paper, a novel machine learning method is introduced to solve the ECG arrhythmia detection problem with a limited amount of data. The proposed method combines the popular techniques of meta-learning and transfer learning. It is shown that our method achieves much higher accuracy in ECG arrhythmia classification with a few data and learns the new task faster than regular deep learning.
|
|
WeAT4 |
Boisdale-2 |
Theme 01. Signal Processing and Classification for Contactless and Wearable
Systems |
Oral Session |
Chair: Heydari Beni, Nargess | University of Waterloo |
|
08:30-08:45, Paper WeAT4.1 | |
Tiny CNN for Seizure Prediction in Wearable Biomedical Devices |
|
Zhang, Yang | Polytechnique Montreal |
Savaria, Yvon | Ecole Polytechnique De Montreal |
Zhao, Shiqi | Westlake University |
Mordido, Gonçalo | Polytechnique Montreal |
Sawan, Mohamad | Westlake University |
Leduc-Primeau, François | Polytechnique Montreal |
Keywords: Neural networks and support vector machines in biosignal processing and classification, Time-frequency and time-scale analysis - Time-frequency analysis, Signal pattern classification
Abstract: Epilepsy is a life-threatening disease affecting millions of people all over the world. Artificial intelligence epileptic predictors offer excellent potential to improve epilepsy therapy. Particularly, deep learning models such as convolutional neural networks (CNN) can be used to accurately detect ictogenesis through deep structured learning representations. In this work, a tiny one-dimensional stacked convolutional neural network (1DSCNN) is proposed based on short-time Fourier transform (STFT) to predict epileptic seizure. The results demonstrate that the proposed method obtains better performance compared to recent state-of-the-art methods, achieving an average sensitivity of 94.44%, average false prediction rate (FPR) of 0.011/h and average area under the curve (AUC) of 0.979 on the test set of the American Epilepsy Society Seizure Prediction Challenge dataset, while featuring a model size of only 21.32kB. Furthermore, after adapting the model to 4-bit quantization, its size is significantly decreased by 7.08x with only 0.51% AUC score precision loss, which shows excellent potential for hardware-friendly wearable implementation.
|
|
08:45-09:00, Paper WeAT4.2 | |
Lightweight Neural Network Based Model for Real-Time Precise HR Monitoring During High Intensity Workout Using Consumer Smartwatches |
|
Fedorin, Illia | Samsung R&D Institute Ukraine |
Pohribnyi, Vitalii | Samsung R&D Institute Ukraine |
Sverdlov, Denys | Samsung R&D Institute Kyiv, Ukraine |
Krasnoshchok, Illia | Samsung R&D Institute Ukraine |
Keywords: Neural networks and support vector machines in biosignal processing and classification, Signal pattern classification, Time-frequency and time-scale analysis - Time-frequency analysis
Abstract: Mobile and wearable electronics is one of the rapidly developing areas of high technologies, which regularly appear new devices that offer new features for monitoring our health, level of physical exertion and everyday activity. From the point of view of medicine and sports, usually wearable devices are used to solve one of two tasks: round-the-clock continuous heart rate monitoring, and the control of the heart rate during workout. One of the drawbacks of such devices is the calculation errors that are caused by the motion artifacts. The most critical of them are those that arise in some cases when performing high-intensity training's, which are conjugate with intense hands movements, such as sprint running or boxing. In addition, limited resources of wearable devices do not always allow usage of very complex algorithms for processing certain events. In our work, an ultra-lightweight framework for a precise real-time heart rate monitoring during the high intensity physical exercises is developed. The model is a combination of a digital signal processing and deep convolutional and recurrent neural networks approaches. From the personalization point of view, the effect of anthropometric parameters has been studied. The model shows an average mean absolute error of 2.4 ± 2.8 bpm during 5 fold cross-validation on an internal dataset, 2.9 ± 3.4 bpm when evaluated on 12 IEEE SPC subjects, and 4.8 ± 5.3 bpm when evaluated on 24 subjects from Wonkwang University dataset. Such results exceed the current state of the art solutions both in terms of the achieved accuracy of heart rate estimation and consumed computational resources.
|
|
09:00-09:15, Paper WeAT4.3 | |
Heartbeat Detection from the Upper Arm Using an SWT-Based Zero-Phase Filter Bank Incorporated with a Voting Scheme |
|
Heydari Beni, Nargess | University of Waterloo |
Jiang, Ning | University of Waterloo |
Keywords: Time-frequency and time-scale analysis - Wavelets
Abstract: Electrocardiogram (ECG) signal provides a graphical representation of cardiac activity and is the most commonly adopted clinical tool for cardiac abnormalities detection. Heartbeat detection, as the first step in analyzing ECG signals, is required for an accurate diagnosis. Stationary wavelet transform (SWT) as a commonly used algorithm for heartbeat detection has a disadvantage of phase shift regarding the original signal. This work addresses this issue by presenting a new method that incorporates an SWT-based zero-phase filter bank with a voting scheme. Our results indicated that a superior performance in heartbeat detection was achieved from the upper arm compared to conventional SWT with a more accurate localization. We achieved sensitivity (SE) and positive predictive value (PPV) of 0.98 ± 0.04 and 0.95 ± 0.09 with the most distance of 50 ms from the actual heartbeats. The SE and PPV changed to 0.75± 0.15 and 0.73± 0.16, respectively for the distance of 20 ms.
|
|
09:15-09:30, Paper WeAT4.4 | |
A Novel Method for the Extraction of Fetal ECG Signals from Wearable Devices |
|
Chowdhury, Shayan | New York State Psychiatric Institute, Columbia University Medica |
Frasch, Martin | University of Washington |
Lucchini, Maristella | Columbia University Irving Medical Center |
Shuffrey, Lauren C. | Columbia University Medical Center |
Sania, Ayesha | Department of Psychiatry and Pediatrics, Columbia University Col |
Malette, Chanel | New York State Psychiatric Institute |
Odendaal, Hein | Stellenbosch University |
Myers, Michael | Columbia University Medical Center |
Fifer, William P. | Department of Psychiatry and Pediatrics, Columbia University Col |
Pini, Nicolò | Columbia University Irving Medical Center |
Keywords: Signal pattern classification, Nonlinear dynamic analysis - Biomedical signals, Time-frequency and time-scale analysis - Nonstationary analysis and modeling
Abstract: The role of fetal surveillance for the prediction and timely assessment of fetal distress is widely established. Fetal ECG (fECG) monitoring via wearable devices is a feasible solution for performing continuous monitoring of fetal wellbeing and it has seen a net increase in popularity in recent years. In this paper, we propose a novel adaptation of the Smart AdaptiVe Ecg Recognition (SAVER) algorithm for the detection of fECG in long-duration recordings acquired in clinical as well as unconventional settings. The methodology was trained and tested on 50 recordings of duration 1 hour (59.33±5.54 min) obtained using the Monica AN24 fetal monitor. We validated the performance against the automatic extraction performed by the Monica DK software. Our results show superior reliability of the proposed methodology in extracting fECG and associated estimates of fetal heart rate (fHR).
|
|
09:30-09:45, Paper WeAT4.5 | |
Design of a Realtime Photoplethysmogram Signal Quality Checker for Wearables and Edge Computing |
|
Banerjee, Tanushree | TCS Innovation Lab |
Gavas, Rahul | TCS Research and Innovation, Tata Consultancy Services Ltd |
BASARALU SHESHACHALA, MITHUN | Tata Consultancy Services |
Karmakar, Somnath | Tata Consultancy Services Limited |
Ramakrishnan, Ramesh Kumar | TATA Consultancy Services |
Pal, Arpan | Tata Consultancy Services |
Keywords: Time-frequency and time-scale analysis - Time-frequency analysis, Physiological systems modeling - Signal processing in physiological systems, Data mining and big data methods - Pattern recognition
Abstract: Photoplethysmogram (PPG) signal is extensively used for deducing health parameters of patients in order to infer about physiological conditions of heart, blood pressure, respiratory patterns, and so on. Such analysis and estimations can be done accurately only on high quality PPG signals with very minimal artifacts. PPG signals collected from fitness grade and smart phone scenarios are prone to muscle artifacts and hence there is a need to assess the signal quality before using the signal. Although there are approaches available in the realm of machine learning and deep learning, they are computationally expensive and may not be suitable for a wearable or edge computing scenario. In this paper, we propose the design of a quality checker to check the quality of the signal that can be directly implemented on edge devices like smartwatch. The algorithm is tested on PPG data collected from wearable, ICU and medical grade devices. In the wearable scenario where the noise levels are very high, our algorithm has performed significantly better with a Fscore of over 0.92. Further we show that by applying the proposed quality checker, the accuracy of the computed heart rate from a smart phone PPG-application significantly improves.
|
|
09:45-10:00, Paper WeAT4.6 | |
Deep Learning Based Non-Contact Physiological Monitoring in Neonatal Intensive Care Unit |
|
Sahoo, Nicky Nirlipta | Indian Institute of Technology, Madras |
Murugesan, Balamurali | Indian Institute of Technology Madras |
Das, Ayantika | Indian Institute of Technology, Madras |
Karthik, Srinivasa | HTIC IIT Madras |
Sirukarumbur Shanmugaram, Keerthi Ram | IIT Madras |
Leonhardt, Steffen | RWTH Aachen University |
Joseph, Jayaraj | HTIC, Indian Institute of Technology Madras |
Sivaprakasam, Mohanasankar | Indian Institute of Technology Madras |
Keywords: Data mining and big data methods - Machine learning and deep learning methods, Data mining and big data methods - Biosignal classification, Neural networks and support vector machines in biosignal processing and classification
Abstract: Preterm babies in the Neonatal Intensive Care Unit (NICU) have to undergo continuous monitoring of their cardiac health. Conventional monitoring approaches are contact-based, making the neonates prone to various nosocomial infections. Video-based monitoring approaches have opened up potential avenues for contactless measurement. This work presents a pipeline for remote estimation of cardiopulmonary signals from videos in NICU setup. We have proposed an end-to-end deep learning (DL) model that integrates a non-learning-based approach to generate surrogate ground truth (SGT) labels for supervision, thus refraining from direct dependency on true ground truth labels. We have performed an extended qualitative and quantitative analysis to examine the efficacy of our proposed DL-based pipeline and achieved an overall average mean absolute error of 4.6 beats per minute (bpm) and root mean square error of 6.2 bpm in the estimated heart rate.
|
|
WeAT5 |
Carron -1 |
Theme 10. General and Theoretical Informatics I |
Oral Session |
Co-Chair: Ganapathy, Nagarajan | Indian Institute of Technology Madras |
|
08:30-08:45, Paper WeAT5.1 | |
ATLAS: An Adaptive Transfer Learning Based Pain AssessmentSystem: A Real Life Unsupervised Pain Assessment Solution |
|
Fang, Ruijie | University of California, Davis |
Zhang, Ruoyu | University of California Davis |
Hosseini, Elahe | University of California, Davis |
Hosseini, Sayed Mohammad | Osprii LLC |
Faghih, Mahya | Johns Hopkins Medical Institution |
Orooji, Mahdi | University of California Davis |
Rafatirad, Soheil | UCLA |
Rafatirad, Setareh | Assistant Professor |
Homayoun, Houman | University of California Davis |
Keywords: General and theoretical informatics - Algorithms, Health Informatics - Informatics for chronic disease management, General and theoretical informatics - Data intelligence
Abstract: Undertreatment or overtreatment of pain will cause severe consequences physiologically and psychologically. Thus, researchers have made great efforts to develop automatic pain assessment approaches based on physiological signals using machine learning techniques. However, state-of-art research mainly focuses on verifying the hypothesis that physiological signals can be used to assess pain. The critical assumption of these studies is that training data and testing data have the same distribution. However, this assumption may not hold in real-life scenarios, for instance, the adoption of machine learning model by a new patient. Such real-life scenarios in which user's data is unlabeled is largely neglected in literature. This study compensates for the rift by proposing an adaptive transfer learning based pain assessment system (ATLAS), a novel adaptive learning system based on the transfer learning algorithm Transfer Components Analysis (TCA) to minimize the distance between training data and unlabeled testing data. Experiments were conducted on BioVid database, and the results showed our approach outperforms three existing traditional machine learning-based approaches and achieves an accuracy just 2.0% below the accuracy with labeled data.
|
|
08:45-09:00, Paper WeAT5.2 | |
Utilizing Deep Learning on Limited Mobile Speech Recordings for Detection of Obstructive Pulmonary Disease |
|
Nathan, Viswam | Samsung Research America Inc |
Vatanparvar, Korosh | Samsung Research America |
Chun, Keum San | The University of Texas at Austin |
Kuang, Jilong | Samsung Research America |
Keywords: General and theoretical informatics - Artificial Intelligence, Sensor Informatics - Physiological monitoring, Health Informatics - Mobile health
Abstract: Passive assessment of obstructive pulmonary disease has gained substantial interest over the past few years in the mobile and wearable computing communities. One of the promising approaches is speech-based pulmonary assessment wherein spontaneous or scripted speech is used to evaluate an individual's pulmonary condition. Recent approaches in this regard heavily rely on accurate speech activity segmentation and specific, hand-crafted features. In this paper, we present an end-to-end deep learning approach for detecting obstructive pulmonary disease. We leveraged transfer learning using a network pre-trained for a different audio-based task, and employed our own additional shallow network on top as a binary classifier to indicate if a given speech recording belongs to an asthma or COPD patient. The additional network was a fully connected neural net with 2 hidden layers, and this was evaluated on two real-world datasets. We demonstrated that the system can identify subjects with obtructive pulmonary disease using their speech with 88.3% precision, 88.8% recall and 88.3% F-1 score using 10-fold cross-validation. The system showed improved performance in identifying the most severely affected subgroup of patients in the dataset, with an average 93.6% accuracy.
|
|
09:00-09:15, Paper WeAT5.3 | |
CoughLIME: Sonified Explanations for the Predictions of COVID-19 Cough Classifiers |
|
Akman, Alican | Imperial College London |
Schuller, Bjoern | Imperial College London |
Wullenweber, Anne | Technical University of Munich |
Keywords: General and theoretical informatics - Artificial Intelligence, General and theoretical informatics - Machine learning, Health Informatics - eHealth
Abstract: Since the emergence of the COVID-19 pandemic, various methods to detect the illness from cough and speech audio data have been proposed. While many of them deliver promising results, they lack transparency in the form of explanations which is crucial for establishing trust in the classifiers. We propose CoughLIME which extends LIME to explanations for audio data, specifically tailored towards cough data. We show that CoughLIME is capable of generating faithful sonified explanations for COVID-19 detection. To quantify the performance of the explanations generated for the CIdeR model, we adopt pixel flipping to audio and introduce a novel metric to assess the performance of the XAI classifier. CoughLIME achieves a DeltaAUC of 19.48,% generating explanations for CIdeR's predictions.
|
|
09:15-09:30, Paper WeAT5.4 | |
Phenotypes Based Classification of Blood-Brain-Barrier Drugs Using Feature Selection Methods and Extreme Gradient Boosting |
|
SUBHA RAMAKRISHNAN, MANUSKANDAN | Anna University |
Ganapathy, Nagarajan | Indian Institute of Technology Madras |
Keywords: General and theoretical informatics - Artificial Intelligence, Health Informatics - Precision medicine, General and theoretical informatics - Knowledge modeling
Abstract: In this work, an attempt has been made to discriminate drug with blood brain barrier (BBB) permeability using clinical phenotypes and extreme gradient boosting (XGBoost) methods. For this, the drug name and their clinical phenotypes namely side effects and indications are obtained from public available database. Prominent clinical phenotypes are selected using genetic algorithm and binary particle swarm optimization. Four machine algorithms namely k-Nearest Neighbours, support vector machines, rotation forest and XGBoost are used for classification of BBB drugs. The result show that the proposed clinical phenotypes based features are able to distinguish drugs with BBB permeability. The maximum number of clinical phenotypes (69%) is reduced by BPSA and GA for classification. The XGBoost method is found to be most accurate (F1=89.7%) is discriminating drugs with BBB permeability. The proposed approach are found to be capable of handling multi-parametric characteristics of the drugs. Particularyl, the combination of XGBoost with combination of side effects and indications could be used for precision medicine applications
|
|
09:30-09:45, Paper WeAT5.5 | |
Creating Computer Vision Models for Respiratory Status Detection |
|
Do, Quan | Mayo Clinic |
Chaudri, Jamil | Marshall University |
|
|
09:45-10:00, Paper WeAT5.6 | |
PsmPy: A Package for Retrospective Cohort Matching in Python |
|
Kline, Adrienne | Northwestern University |
Luo, Yuan | Northwestern University |
Keywords: General and theoretical informatics - Causality analysis and case-based reasoning, Health Informatics - Outcome research, Public Health Informatics - Epidemiological modeling
Abstract: Propensity score matching (PSM) is a technique used in retrospective investigation of cohort matching as an alternative approach to the prospective matching that is typically used by a randomized control trial (RCT). The process of selecting untreated cases that are the best match tothe treated cases is the focus of this research. We created aPSM package for the python environment, termed PsmPy, to carry out this task. The PsmPy package debuted and proposed here is based on a logistic regression logit score where a match is selected using k-nearest neighbors (k-NN). Additional plotting and arguments are available to the user and are also described. To benchmark our method, we compared it with the existing R package, MatchIt, and evaluated our covariates’ residual effect sizes with respect to the treatment condition before and after matching. Using a Mann-Whitney statistical test, we showed that our method significantly outperformed MatchIt in cohort matching (U=49, p<0.0001)when comparing residual effect sizes of the covariates. The PsmPy demonstrated a 10-fold average improvement in residual effect sizes amongst covariates when compared with the package MatchIt, suggesting that it is a viable alternative for use in propensity matching studies
|
|
WeAT6 |
Carron-2 |
Theme 10. General and Theoretical Informatics - Data Privacy |
Oral Session |
|
08:30-08:45, Paper WeAT6.1 | |
Privacy-Preserving Model Training for Disease Prediction Using Federated Learning with Differential Privacy |
|
Khanna, Amol | Johns Hopkins University |
Schaffer, Vincent | Yale University |
Gursoy, Gamze | University of Illinois at Chicago |
Gerstein, Mark | Yale University |
Keywords: General and theoretical informatics - Data privacy, General and theoretical informatics - Machine learning, General and theoretical informatics - Security and authentication
Abstract: Machine learning is playing an increasingly critical role in health science with its capability of inferring valuable information from high-dimensional data. More training data provides greater statistical power to generate better models that can help decision-making in healthcare. However, this often requires combining research and patient data across institutions and hospitals, which is not always possible due to privacy considerations. In this paper, we outline a simple federated learning algorithm implementing differential privacy to ensure privacy when training a machine learning model on data spread across different institutions. We tested our model by predicting breast cancer status from gene expression data. Our model achieves a similar level of accuracy and precision as a single-site non-private neural network model when we enforce privacy. This result suggests that our algorithm is an effective method of implementing differential privacy with federated learning, and clinical data scientists can use our general framework to produce differentially private models on federated datasets. Our framework is available at https://github.com/gersteinlab/idash20FL.
|
|
08:45-09:00, Paper WeAT6.2 | |
Fair and Privacy-Preserving Alzheimer’s Disease Diagnosis Based on Spontaneous Speech Analysis Via Federated Learning |
|
Meerza, Syed Irfan Ali | University of Tennessee Knoxville |
Li, Zhuohang | University of Tennessee, Knoxville |
Liu, Luyang | Google Research |
Zhang, Jiaxin | Oak Ridge National Laboratory |
Liu, Jian | University of Tennessee, Knoxville |
Keywords: General and theoretical informatics - Data privacy, General and theoretical informatics - Machine learning, Health Informatics - Behavioral health informatics
Abstract: As the most common neurodegenerative disease among older adults, Alzheimer's disease (AD) would lead to loss of memory, impaired language and judgment, gait disorders, and other cognitive deficits severe enough to interfere with daily activities and significantly diminish quality of life. Recent research has shown promising results in automatic AD diagnosis via speech, leveraging the advances of deep learning in the audio domain. However, most existing studies rely on a centralized learning framework which requires subjects' voice data to be gathered to a central server, raising severe privacy concerns. To resolve this, in this paper, we propose the first federated-learning-based approach for achieving automatic AD diagnosis via spontaneous speech analysis while ensuring the subjects' data privacy. Extensive experiments under various federated learning settings on the ADReSS challenge dataset show that the proposed model can achieve high accuracy for AD detection while achieving privacy preservation. To ensure fairness of the model performance across clients in federated settings, we further deploy fair aggregation mechanisms, particularly q-FEDAvg and q-FEDSgd, which greatly reduces the algorithmic biases due to the data heterogeneity among the clients.
|
|
09:00-09:15, Paper WeAT6.3 | |
ECG Biosignal Deidentification Using Conditional Generative Adversarial Networks |
|
JAFARLOU, SALAR | University of California, Irvine |
Rahmani, Amir M. | Department of Computer Science, University of California Irvine, |
Dutt, Nikil | University of California, Irvine |
Rahimi Mousavi, Sanaz | California State University, Dominguez Hills |
Keywords: General and theoretical informatics - Data privacy, General and theoretical informatics - Deep learning and big data to knowledge, Sensor Informatics - Physiological monitoring
Abstract: Electrocardiogram (ECG) signals provide rich information on individuals' potential cardiovascular conditions and disease, ranging from coronary artery disease to the risk of a heart attack. While health providers store and share these information for medical and research purposes, such data is highly vulnerable to privacy concerns, similar to many other types of healthcare data. Recent works have shown the feasibility of identifying and authenticating individuals by using ECG as a biometric due to the highly individualized nature of ECG signals. However, to the best of our knowledge, there does not exist a method in the literature attempting to de-identify ECG signals. In this paper, to address this privacy protection gap, we propose a Generative Adversarial Network (GAN)-based framework for de-identification of ECG signals. We leverage a combination of a standard GAN loss, an Ordinary Differential Equations (ODE)-based, and identity-based loss values to train a generator that de-identifies a ECG signal while preserving structure the ECG signal and information regarding the target cardio vascular condition. We evaluate our framework in terms of both qualitative and quantitative metrics considering different weightings over the above-mentioned losses. Our experiments demonstrate the efficiency of our framework in terms of privacy protection and ECG signal structural preservation.
|
|
09:15-09:30, Paper WeAT6.4 | |
Privacy-Preserving Speech-Based Depression Diagnosis Via Federated Learning |
|
Cui, Yue | University of Tennessee |
Li, Zhuohang | University of Tennessee, Knoxville |
Liu, Luyang | Google Research |
Zhang, Jiaxin | Oak Ridge National Laboratory |
Liu, Jian | University of Tennessee, Knoxville |
Keywords: General and theoretical informatics - Data privacy, General and theoretical informatics - Machine learning, Health Informatics - Behavioral health informatics
Abstract: Mental health disorders, such as depression, affect a large and growing number of populations worldwide, and they may cause severe emotional, behavioral and physical health problems if left untreated. As depression affects a patient's speech characteristics, recent studies have proposed to leverage deep-learning-powered speech analysis models for depression diagnosis, which often require centralized learning on the collected voice data. However, this centralized training requiring data to be stored at a server raises the risks of severe voice data breaches, and people may not be willing to share their speech data with third parties due to privacy concerns. To address these issues, in this paper, we demonstrate for the first time that speech-based depression diagnosis models can be trained in a privacy-preserving way using federated learning, which enables collaborative model training while keeping the private speech data decentralized on clients' devices. To ensure the model's robustness under attacks, we also integrate different FL defenses into the system, such as norm bounding, differential privacy, and secure aggregation mechanisms. Extensive experiments under various FL settings on the DAIC-WOZ dataset show that our FL model can achieve high performance without sacrificing much utility compared with centralized-learning approaches while ensuring users' speech data privacy.
|
|
09:30-09:45, Paper WeAT6.5 | |
A Novel Large Structured Cardiotocographic Database |
|
Spairani, Edoardo | University of Pavia |
Daniele, Beniamino | Politecnico Di Milano |
Signorini, Maria G. | Politecnico Di Milano |
Magenes, Giovanni | University of Pavia |
|
|
09:45-10:00, Paper WeAT6.6 | |
Generative Moment Matching Networks for Genotype Simulation |
|
Perera, Maria | Universitat Politècnica De Catalunya |
Mas Montserrat, Daniel | Stanford University |
Barrabes, Miriam | UPC |
Geleta, Margarita | UC Irvine |
Giro-i-Nieto, Xavier | Universitat Politecnica De Catalunya |
Ioannidis, Alexander | Stanford University |
Keywords: General and theoretical informatics - Deep learning and big data to knowledge, General and theoretical informatics - Machine learning, General and theoretical informatics - Data privacy
Abstract: The generation of synthetic genomic sequences using neural networks has potential to ameliorate privacy and data sharing concerns and to mitigate potential bias within datasets due to under-representation of some population groups. However, there is not a consensus on which architectures, training procedures, and evaluation metrics should be used when simulating single nucleotide polymorphism (SNP) sequences with neural networks. In this paper, we explore the use of Generative Moment Matching Networks (GMMNs) for SNP simulation, we present some architectural and procedural changes to properly train the networks, and we introduce an evaluation scheme to qualitatively and quantitatively assess the quality of the simulated sequences.
|
|
WeAT7 |
Dochart-1 |
Theme 05. Cardiac Disease |
Oral Session |
Chair: Heldt, Thomas | Massachusetts Institute of Technology |
Co-Chair: Mukkamala, Ramakrishna | University of Pittsburgh |
|
08:30-08:45, Paper WeAT7.1 | |
Ventricular and Atrial Ejection Fractions Are Associated with Mean Compartmental Cavity Volume in Cardiac Disease |
|
Kerkhof, Peter LM | VU University Medical Center |
Heyndrickx, Guy R. | Cardiovascular Center, OLV Clinic, Aalst |
Diaz-Navarro, Rienzi | Universidad De Valparaiso |
Antohi, Elena-Laura | Emergency Institute for Cardiovascular Diseases, University For |
Serban, Mihaileanu | Institut Mutualiste Montsouris, Paris |
Handly, Neal | Dept. Emergency Medicine, Drexel UniversityCollege of Medicine, |
Keywords: Cardiac mechanics, structure & function - Ventricular mechanics, Cardiac mechanics, structure & function - Cardiac structure from imaging, Cardiac mechanics, structure & function - Heart failure
Abstract: Ejection fraction (EF) is considered to provide clinically useful information. Despite its enormous popularity, with more than 75,000 citations in PubMed, only few studies have traced the origin(s) of its foundation. This fact is surprising, as there are perhaps more papers published that criticize EF, than the number of publications that actually provide a solid (mathematical) basis for its alleged applicability. EF depends on two volume determinations, namely end-systolic volume (ESV) and end-diastolic volume (EDV). EF is defined as 1-ESV/EDV, yielding a metric without physical units. Previously we formulated a robust analytical expression for the nonlinear connection between EF and ESV. Here we extend that approach by providing a formula to illustrate that EF is strongly associated with half the sum (HS) of ESV and EDV. HS is not new, but forms a major component in the recently introduced Global Function Index. For 420 heart failure (HF) patients we found for left ventricular angio data: R(ESV, EDV) = 0.92, R(EF, ESV) = -0.90, and R(EF, HS) = -0.65. For echo (33 HF patients stages A, B, C and D): R(EF, HS) = -0.82. For the right atrium (CMRI in 21 acute myocardial infarction patients): R(EF, HS)=-0.65. For the left atrium (N=86) R (EF, HS)=-0.46. ESV indicates the level to which the ventricle is able to squeeze blood out of the cavity via pressure build-up. In contrast, EF refers to relative volume changes, not to the mechanism of pumping action. We conclude that for each cardiac compartment EF borrows its acclaimed attractiveness from the fact that for a wide patient spectrum the ESV and EDV correlate in a fairly linear manner. Attractiveness of EF features a straightforward mathematical derivation, rather than reflecting underlying physiology.
|
|
08:45-09:00, Paper WeAT7.2 | |
System Design for Optimizing Drug Infusions Using Cardiovascular Space Mapping for Acute Heart Failure |
|
Kataoka, Yasuyuki | NTT Research, Inc |
Fukuda, Yukiko | NTT Research, Inc |
Peterson, Jon | NTT Research, Inc |
Shelly, Iris | NTT Research, Inc |
Alexander, Joe | NTT Research, Inc |
Sunagawa, Kenji | Circulatory System Research Foundation |
Keywords: Cardiovascular and respiratory system modeling - Cardiovascular control models, Cardiac mechanics, structure & function - Heart failure, Cardiovascular, respiratory, and sleep devices - Therapeutics
Abstract: Acute heart failure is caused by various factors and requires multiple drug therapies to remedy underlying causes. Due to the complexity of pharmacologic effects of cardiovascular agents, few studies have theoretically addressed the multidrug optimization problem. This paper proposes a drug infusion system for acute heart failure that controls cardiovascular performance metrics (cardiac output, left atrial pressure, and mean arterial pressure) within desired ranges as dictated by the cardiovascular parameters (systemic vascular resistance, cardiac contractility, heart rate, and stressed blood volume). The key to our system design is modeling and controlling cardiovascular parameters to yield the desired cardiovascular metrics. A `tailored drug infusion' technique controls parameters by solving the optimization problem in order to conquer the complexity of multi-dependencies and the different dosage limits among multiple drugs. A `cardiovascular space mapping' technique identifies the desired parameters from the desired metrics by deriving the analytical solutions of the metrics as functions of the parameters. To facilitate clinical discussions, parameters were set to realistic values in 5,600 simulated patients. Our results showed not only that the optimized drug combinations and dosages controlled the cardiovascular metrics to within the desired ranges, but also that they mostly corresponded to the recommended clinical use guidelines. An additional value of our system is that it proactively predicts the limitations of the tailored drug therapy, which supports the clinical decision of pivoting to alternative treatment strategies such as mechanical circulatory support.
|
|
09:00-09:15, Paper WeAT7.3 | |
Recurrence Plot-Based Classification of Ischemic and Dilated Cardiomyopathy Patients |
|
Rodriguez, Javier | Institut De Bioenginyeria De Catalunya (IBEC) |
Schulz, Steffen | Charité Universitätsmedizin Berlin |
Voss, Andreas | Technical University Ilmenau |
Giraldo, Beatriz | Institute for Bioengineering of Catalonia (IBEC) |
Keywords: Cardiovascular and respiratory signal processing - Pulse transit time, Cardiovascular and respiratory signal processing - Lung Sounds
Abstract: A large portion of the elderly population are affected by cardiovascular diseases. The early prognosis of cardiomyopathies is still a challenge. The aim of this study was to classify cardiomyopathy patients by their etiology in function of significant indexes extracted from the characterization of the recurrence plot of the systems involved. Thirty-nine cardiomyopathy patients (CMP) classified as ischemic (ICM – 24 patients) and dilated (DCM – 15 patients) were considered. In addition, thirty-nine control subjects (CON) were used as reference. The beat-to-beat (BBI) time series, from the electrocardiographic signal, the systolic (SBP), and diastolic (DBP) time series, from the blood pressure signal, and the respiratory time (FLW) from the respiratory flow signal, were extracted. The recurrence plot from each signal considered were calculated and characterized by a total of 12 indexes. The best classifiers were used to build support vector machine models. The optimal model to classify ICM versus DCM patients achieved 92.3% accuracy, 95.8% sensitivity, and 86.6% specificity. When comparing CMP patients and CON subjects, the best model achieved 85.8% accuracy, 92.3% sensitivity, and 80.1% specificity. Our results suggest a more deterministic behavior in DCM patients. Clinical Relevance— This study explores the recurrence plot for the classification of ICM and DCM patients.
|
|
09:15-09:30, Paper WeAT7.4 | |
Various Approaches to Define the Volume Intercept of the Ventricular End-Systolic Pressure-Volume Relationship: Implications for Statistical Analysis |
|
Kerkhof, Peter LM | VU University Medical Center |
Li, John K-J. | Rutgers University |
Handly, Neal | Dept. Emergency Medicine, Drexel UniversityCollege of Medicine, |
|
|
09:30-09:45, Paper WeAT7.5 | |
Characterization of Physiologic Patients' Response to Fluid Interventions in the Intensive Care Unit |
|
Mollura, Maximiliano | Politecnico Di Milano |
Salerni, Claudia | Politecnico Di Milano |
Lehman, Li-wei | Massachusetts Institute of Technology |
Barbieri, Riccardo | Politecnico Di Milano |
Keywords: Cardiovascular regulation - Heart rate variability, Cardiovascular regulation - Baroreflex, Pulmonary and critical care - Bioengineering applications in Intensive care
Abstract: Fluid administration is one of the most common therapies performed on intensive care patients. However, no clinical evidence is available to establish optimal strategies for fluid management as well as characterizing the effects on the cardiovascular system after therapy initiation. Moreover, fluid overload showed a correlation with worse clinical outcomes. This study aims at characterizing the response to the fluid intervention of intensive care unit patients. We extracted a population of 57 subjects with available electrocardiogram and arterial blood pressure recordings from the MIMIC-III database and evaluated the induced changes in cardiovascular and autonomic indices. We compare autonomic indices obtained from a statistical model of heartbeat dynamics before and after the intervention. Results show significant differences in RR interval, blood pressure, autonomic and Baroreflex activities up to 60 minutes after fluid administration. Specifically, we observed a median increase in RR interval, Baroreflex activity, and overall activity both in pressure and RR time series, as well as a reduction in systolic blood pressure. Specifically, a subgroup of survived patients shows an imbalance toward sympathetic activity, whereas non-survivors have a persistent vagal state after fluid administration. Clinical relevance - The observed differences in autonomic response after fluid administration, together with the assessment of their correlation with patients' mortality, paves the way for the inclusion of heart rate variability indices as markers for assessing fluid responsiveness as associated with ICU patients' state.
|
|
09:45-10:00, Paper WeAT7.6 | |
Predicting Hypertensive Events with Time-Series Analysis of Mean Arterial Pressure |
|
Connor, J Patrick | Massachusetts General Hospital |
Pepino, Jeremy Alanano | Massachusetts General Hospital |
Kwon, Brian | University of British Columbia |
Horiguchi, Daisuke | Nihon Kohden Corporation |
Hahn, Jin-Oh | University of Maryland |
Reisner, Andrew | Massachusetts General Hospital |
Keywords: Pulmonary and critical care - Bioengineering applications in Intensive care, Vascular mechanics and hemodynamics - Arterial pressure in cardiovascular disease, Cardiovascular and respiratory signal processing - Cardiovascular signal processing
Abstract: We investigated whether a statistical model used previously to predict hypotension from mean arterial pressure (MAP) time series analysis could predict hypertension. We performed a retrospective analysis of minute-by-minute MAP records from two cohorts of intensive care unit (ICU) patients. The first cohort was comprised of surgical and medical ICUs while the second cohort was comprised of acute spinal cord injury (ASCI) patients in a neurological ICU. At each time point with physiological MAP, time series analysis was used to predict the median MAP for the subsequent 20 min. This method was used to predict hypertensive episodes, i.e., intervals of 20 or more minutes where at least half of the MAP measurements were > 105 mmHg.Advance prediction of hypertensive episodes was similar in the two cohorts (69.15% vs. 82.61%, respectively), as was positive predictive value of the hypertension predictions (67.42% vs. 71.57%). The results suggest that the methodology may be usable for predicting hypertension from time-series analysis of MAP.
|
|
WeAT8 |
Dochart-2 |
Theme 09. Clinical Engineering I |
Oral Session |
Chair: Pino, Esteban J | Universidad De Concepcion |
Co-Chair: Panescu, Dorin | Biotronik |
|
08:30-08:45, Paper WeAT8.1 | |
Measurement of Peritoneal Fluid Absorption and Ultrafiltration During Peritoneal Dialysis Using Segmental Bioimpedance |
|
Zhu, Fansan | Renal Research Institute |
Laura Rosales M., Laura | Renal Research Institute |
Tisdale, Lela | Renal Research Institute |
Villarama, Maricar | Renal Research Institute |
Kotanko, Peter | Renal Research Institute |
Keywords: Artificial organs (including heart, kidney, liver, pancreas, retina), Diagnostic devices - Physiological monitoring
Abstract: The aim of this study was to measure intraperitoneal volume (IPV) and ultrafiltration volume (UFV) by monitoring the abdominal resistance using segmental bioimpedance analysis (SBIA, Hydra 4200). Twenty peritoneal dialysis (PD) patients were studied during a fill with 2 L of 2.5% glucose peritoneal dialysate solution. UFVDrain (g) was measured as weight difference between fill and drain dialysate volumes. Ultrafiltration volume (UFVSBIA; ml) and absorption volume were calculated from the IPV curve derived by SBIA. UFVSBIA correlated with UFVDrain (R2=0.98, p<0.0001). This study may provide actionable clinical insights and help clinicians to better understand the function of the peritoneal membrane in individuals. Clinical Relevance— This technique may support personalized medicine by aiding the prescription of PD therapy on a patient-level.
|
|
08:45-09:00, Paper WeAT8.2 | |
Multi-Frequency Electrical Impedance Pneumography System As Point-Of-Care Device |
|
Pino, Esteban J | Universidad De Concepcion |
Alvarado, Fabian | Universidad De Concepcion |
Keywords: Physiological monitoring & diagnistic devices - Pulmonary disease, Wearable or portable devices for vital signal monitoring, Ambulatory Diagnostic devices - Point of care technologies
Abstract: In this work we present the development of a multi-frequency electrical impedance pneumography (EIP) system based on a portable acquisition device and a mobile platform. This design is intended as an upgrade to our previous device for clinical use in the screening of patients with pulmonary diseases. The acquisition device uses the bioimpedance analog front end MAX30001, a mux/demux stage, Bluetooth 4.0 communication and an ESP32 microcontroller unit. It generates an excitation current of 8 uApp in a range of selectable frequencies from 1 kHz to 130 kHz. The mobile platform provides a real-time respiration signal at 64 S/s and allows the configuration of the device. Results show better accuracy in higher frequencies, which are the most common in these applications, ensuring the feasibility of the system for use in humans. Some hardware constraints related to EIP integrated circuits are discussed, an their effect in the signal acquisition.
|
|
09:00-09:15, Paper WeAT8.3 | |
Electrocution Risk of Capacitive Discharge Shocks: Application to Electric Vehicle Charging |
|
Kroll, Mark William | University of Minnesota |
Panescu, Dorin | Biotronik |
Perkins, Pete | Safety Engineering |
Koch, Michael | Eaton GmbH |
Andrews, Chris | University of Queensland |
Keywords: Clinical engineering - Device safety and efficacy evaluation (electrical safety, electromagnetic compatibility and immunity)
Abstract: It is difficult to electrocute (induce ventricular fibrillation) with capacitive discharge shocks. With small capacitance values, the high voltages required for the necessary charge are rarely seen in industrial situations (e.g. electric vehicle charging stations). On the other hand, with large capacitance values, the discharge time is so great that the shock couples inefficiently with the cardiac cells. The update to IEC 60479-2 sets the C1 “mostly-safe” charge limit of 3 mC for a short “impulse function” pulse. We calculated the equivalent capacitor stored charge for an arbitrary capacitance value using the simple single membrane time constant model for the cardiac response. The peak membrane response was set equal to that of the 3 mC impulse function response to calculate the safe values for stored charge, voltage, and energy. The total stored charge, per se, cannot be used simplistically to estimate the danger of a capacitive discharge shock. A capacitive-discharge shock cannot be accurately compared to a rectangular shock with a duration equal to the shock time constant. The greater the capacitance, the larger the fraction of wasted charge in coupling to the heart and thus the shorter equivalent duration compared to the shock time constant. For a capacitive discharge shock this translates to a stored charge of 3 mC increasing up to 9 mC for a 10 µF capacitor using the assumed 575 Ω load for an electric-vehicle (EV) charging station. In the area of interest for 1 - 10 µF, the safe voltage ranges from 1300 to 4700 V, which includes the 1500-VDC scope of EV charger standard IEC 61851-23. For C > 100 µF, the voltage asymptote is 700 V.
|
|
09:15-09:30, Paper WeAT8.4 | |
Design of a 6-DoF Cost-Effective Differential-Drive Based Robotic System for Upper-Limb Stroke Rehabilitation |
|
Jonna, Prashanth | IIIT-Bangalore |
Rao, Madhav | IIITBangalore |
Keywords: Robotic-aided therapies - Targeted therapy systems, Neuromuscular systems - Stroke therapy devices / technologies, Ambulatory Therapeutic Devices - Personalized therapeutic devices and emergency response systems
Abstract: This paper discusses the design, construction, and characteristics of a six degree of freedom (6-DoF) robotic upper limb stroke rehabilitation device. The device is primarily designed to be used by stroke survivors suffering from hemiparesis, post stroke therapy. The proposed device is aimed to substitute upper limb exoskeletons or end-effector robotic arm based rehabilitation systems, which are generally bulky, expensive due to customized design, and are used only in clinical settings. The device is capable of aiding patients perform rehabilitation exercises involving abduction, adduction, flexion, and extension movements for the wrist, shoulder joints and flexion, extension, pronation, supination for the elbow joint. The device has a mobile base and uses differential drive principles for movement. The device provides an end-effector plate having force-sensitive resistors (FSRs) on which the patient can rest their hand. The mobile base of the rehabilitation system enables it to aid for a greater range of movements when compared to other end-effector-based upper limb rehabilitation devices. Additionally, a camera is mounted on top of the 6-DoF robotic system to enable finger tracking from a remote system using the MediaPipe framework, and measure the hand instability metric over time to assess patient's performance. Clinical relevance — The 6-DoF rehabilitation system is capable of aiding different range of motions for upper limb. The low-cost generic system is applicable for different physical personalities, enabling quick adoption for large patient populations who are in need of rehabilitation systems. The 6-DoF system is aimed towards clinical, and domestic usage by patients.
|
|
09:30-09:45, Paper WeAT8.5 | |
Cystatin C As a Biomarker for Cardiorenal Syndrome Diseases Quantitative Diagnostics and Monitoring Via Point-Of-Care |
|
Zhang, Xushuo | Ulster University |
Fishlock, Sam | Ulster University |
Sharpe, Peter | Craigavon Area Hospital |
McLaughlin, James | University of Ulster |
Keywords: Ambulatory Diagnostic devices - Point of care technologies, Clinical engineering -Verification and validation of diagnostic & therapeutic systems / technologies, Physiological monitoring & diagnistic devices - Heart failure
Abstract: Abstract— With heart failure (HF) and renal malfunction becoming global public health issues, there is an urgent need to monitor diseases at home or in the community. Point-of-care testing (POC) would shorten the patients waiting time compared with laboratory molecular analysis. This work evaluates two types of gold nanomaterials, and two assay protocols, to develop a lateral flow (LF) system for Cystatin C (CysC) quantification. Of the protocols, the ‘delayed-release’ shows the alleviation of the hook effects with 1% BSA running buffer (RB), albeit at increased complexity with three steps of washing. The standard method with sample dilution (1: 150 sample dilution for GNPs, and 1:10 sample dilution for GNRs) can ensure the clinical range detection of CysC as 1 mg/L with partial LF assays. GNPs have stronger optical signal intensity compared with GNRs and developed full LF assays with GNPs require 1:1.5 sample dilution in recombinant CysC detection. The ideal sample dilution ratio is different for partial and full LF assays. Clinical Relevance— This work is the basis of future work that will use LF devices for human serum/plasma monitoring to assess kidney function related to heart failure during medication. The specificity, sensitivity, and limit of detection will be validated via a clinical trial before potential clinical use.
|
|
09:45-10:00, Paper WeAT8.6 | |
GlucoseML Mobile Application for Automated Dietary Assessment of Mediterranean Food |
|
Konstantakopoulos, Fotios S. | University of Ioannina |
Georga, Eleni I. | University of Ioannina |
Tzanettis, Kostis | AppArt SA |
Kokkinopoulos, Konstantinos A. | AppArt SA |
Raptis, Stefanos K. | AppArt SA |
Michaloglou, Klearchos A. | AppArt SA |
Fotiadis, Dimitrios I. | University of Ioannina |
Keywords: Computer modeling for treatment planning, Glucose remote monitoring devices and technologies, Diagnostic devices - Physiological monitoring
Abstract: Over the years and with the help of technology, the daily care of type 1 diabetes has been improved significantly. The increased adoption of continuous glucose monitoring, the continuous subcutaneous insulin injection and the accurate behavioral monitoring mHealth solutions have contributed to this phenomenon. In this study we present a mobile application for automated dietary assessment of Mediterranean food images as part of the GlucoseML system. Based on short-term predictive analysis of the glucose trajectory, GlucoseML is a type-1 diabetes self-management system. A computer vision approach is used as main part of the GlucoseML dietary assessment system calculating food carbohydrates, fats and proteins, relying on: (i) a deep learning subsystem for food image classification, and (ii) a 3D food image reconstruction subsystem for the volume estimation of food. The deep learning subsystem achieves 82.4% and 97.5% top-1 and top-5 accuracy, respectively, for food image classification while the subsystem for volume estimation of food achieves a mean absolute percentage error 10.7% for the four main categories of MedGRFood dataset.
|
|
WeAT9 |
Gala |
Theme 02. Image Analysis and Classification - Machine Learning / Deep
Learning Approaches - I |
Oral Session |
Co-Chair: Kupas, David | University of Debrecen |
|
08:30-08:45, Paper WeAT9.1 | |
Multiclass Classification of Prostate Tumors Following an MR Image Analysis-Based Radiomics Approach |
|
Jaén-Lorites, José Manuel | Universitat Politècnica De València |
Ruiz-España, Silvia | Universitat Politècnica De València |
Piñeiro-Vidal, Tania | ASCIRES Biomedical Group |
Santabárbara, José Manuel | ASCIRES Grupo Biomédico |
Maceira, Alicia M. | ASCIRES Grupo Biomédico |
Moratal, David | Universitat Politècnica De València |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image feature extraction, Magnetic resonance imaging - Other organs
Abstract: Prostate cancer is one of the most common cancers in men, with symptoms that may be confused with those caused by benign prostatic hyperplasia. One of the key aspects of treating prostate cancer is its early detection, increasing life expectancy and improving the quality of life of those patients. However, the tests performed are often invasive, resulting in a biopsy. A non-invasive alternative is the magnetic resonance imaging (MRI)-based PI-RADS v2 classification. The aim of this work was to find objective biomarkers that allow the PI-RADS classification of prostate lesions using a radiomics approach on Multiparametric MRI. A total of 90 subjects were analyzed. From each segmented lesion, 609 different texture features were extracted using five different statistical methods. Two feature selection methods and eight multiclass predictive models were evaluated. This was a multiclass study in which the best AUC result was 0.7442 ± 0.0880, achieved with the Naïve Bayes model using a subset of 120 features. Valuable results were also obtained using the Random Forests model, obtaining an AUC of 0.7394 ± 0.0965 with a lower number of features (52).
|
|
08:45-09:00, Paper WeAT9.2 | |
Bilateral Analysis Boosts the Performance of Mammography-Based Deep Learning Models in Breast Cancer Risk Prediction |
|
Mohamed, Alaa | Cairo University |
Fakhry, Sherihan | Baheya Foundation for Early Detection and Treatment of Breast Ca |
Basha, Tamer | Cairo University |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image classification
Abstract: Breast cancer is one of the leading causes of death among women. Early prediction of breast cancer can significantly improve the survival rates. Breast density was proven as a reliable risk factor. Deep learning models can learn subtle cues in the mammogram images. CNN models were recently shown to improve the risk discrimination in full-field mammograms. This study aims to improve risk prediction models using bilateral analysis. Bilateral analysis is the process of comparing two breasts to verify presence of anomalies. We developed a Siamese neural network to leverage the bilateral information and asymmetries between the two mammograms of the same patient. We tested our model on 271 patients and compared the results of our Siamese model against the traditional unilateral CNN model. Our results showed AUCs of 0.75 and 0.70 respectively (p = 0.0056). The Siamese model also exhibits higher sensitivity, specificity, precision, and false positive rate with values of 0.68, 0.69, 0.71, 0.31 respectively. While the CNN values were 0.61, 0.66, 0.67, 0.34 respectively. We merged both models by two techniques using pre-trained weights and weighted voting ensemble. The merging technique boosted the AUC to 0.78. The results suggest that bilateral analysis can significantly improve the risk discrimination.
|
|
09:00-09:15, Paper WeAT9.3 | |
MCascade R-CNN: a Modified Cascade R-CNN for Detection of Calcified on Coronary Artery Angiography Images (withdrawn from program) |
|
Wang, Wei | Beijing University of Posts and Telecommunications, CHINA |
zhang, yi | Beijing University of Posts and Telecommunications |
Wang, Xiaofei | Beijing University of Posts and Telecommunications |
Zhang, Honggang | Beijing University of Posts and Telecommunications |
Xie, Lihua | Fu Wai Hospital, National Center for Cardiovascular Diseases, Ch |
Xu, Bo | Fu Wai Hospital, National Center for Cardiovascular Diseases, Ch |
|
09:15-09:30, Paper WeAT9.4 | |
Data-Efficient Training of Pure Vision Transformers for the Task of Chest X-Ray Abnormality Detection Using Knowledge Distillation |
|
Jalalifar, Ali | York University |
Sadeghi-Naini, Ali | York University |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, X-ray imaging applications, Machine learning / Deep learning approaches
Abstract: It is generally believed that vision transformers (ViTs) require a huge amount of data to generalize well, which limits their adoption. The introduction of data-efficient algorithms such as data-efficient image transformers (DeiT) provided an opportunity to explore the application of ViTs in medical imaging, where data scarcity is a limiting factor. In this work, we investigated the possibility of using pure transformers for the task of chest x-ray abnormality detection on a small dataset. Our proposed framework is built on a DeiT structure benefiting from a teacher-student scheme for training, with a DenseNet with strong classification performance as the teacher and an adapted ViT as the student. The results show that the performance of transformers is on par with that of convolutional neural networks (CNNs). We achieved a test accuracy of 92.2% for the task of classifying chest x-ray images (normal/pneumonia/COVID-19) on a carefully selected dataset using pure transformers. The results show the capability of transformers to accompany or replace CNNs for achieving state-of-the-art in medical imaging applications. The code and models of this work are available at https://github.com/Quantimb-Lab/DeiT_Covid.
|
|
09:30-09:45, Paper WeAT9.5 | |
Gauging Facial Abnormality Using Haar-Cascade Object Detector |
|
Takiddin, Abdulrahman | Texas A&M University |
Mohammad Shaqfeh, Mohammad | Texas A&M University at Qatar |
Boyaci, Osman | Texas A&M University |
Serpedin, Erchin | Texas A&M University |
Stotland, Mitchell | Sidra Medicine |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Deformable registration, Functional image analysis
Abstract: The overriding clinical and academic challenge that inspires this work is the lack of a universally accepted, objective, and feasible method of measuring facial deformity; and, by extension, the lack of a reliable means of assessing the benefits and shortcomings of craniofacial surgical interventions. We propose a machine learning-based method to create a scale of facial deformity by producing numerical scores that reflect the level of deformity. An object detector that is constructed using a cascade function of Haar features has been trained with a rich dataset of normal faces in addition to a collection of images that does not contain faces. After that, the confidence score of the face detector was used as a gauge of facial abnormality. The scores were compared with a benchmark that is based on human appraisals obtained using a survey of a range of facial deformities. Interestingly, the overall Pearson's correlation coefficient of the machine scores with respect to the average human score exceeded 0.96.
|
|
09:45-10:00, Paper WeAT9.6 | |
Classification of Pap-Smear Cell Images Using Deep Convolutional Neural Network Accelerated by Hand-Crafted Features |
|
Kupas, David | University of Debrecen |
Harangi, Balazs | University of Debrecen |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image analysis and classification - Digital Pathology, Image feature extraction
Abstract: The classification of cells extracted from Pap-smears is in most cases done using neural network architectures. Nevertheless, the importance of features extracted with digital image processing is also discussed in many related articles. Decision support systems and automated analysis tools of Pap-smears often use these kinds of manually extracted, global features based on clinical expert opinion. In this paper, a solution is introduced where 29 different contextual features are combined with local features learned by a neural network so that it increases classification performance. The weight distribution between the features is also investigated leading to a conclusion that the numerical features are indeed forming an important part of the learning process. Furthermore, extensive testing of the presented methods is done using a dataset annotated by clinical experts. An increase of 3.2% in F1-Score value can be observed when using the combination of contextual and local features.
|
|
WeAT10 |
Forth |
Theme 02. Magnetic Resonance Imaging |
Oral Session |
Chair: Chang, Yuchou | University of Massachusetts Dartmouth |
Co-Chair: Ji, Jim Xiuquan | Texas A&M University |
|
08:30-08:45, Paper WeAT10.1 | |
Interpretable Dimension Reduction for MRI Channel Suppression |
|
Chang, Yuchou | University of Massachusetts Dartmouth |
Zhang, Jiming | University of Vermont Medical Center |
Pham, Huy Anh | 8100 Cambridge St |
Lyu, Jingyuan | UIH America, Inc |
Li, Zhiqiang | Barrow Neurological Institute |
Keywords: Magnetic resonance imaging - Parallel MRI, Image reconstruction - Fast algorithms, Image reconstruction and enhancement - Machine learning / Deep learning approaches
Abstract: Channel suppression can reduce the redundant information in multiple channel receiver coils and accelerate reconstruction speed to meet real-time imaging requirements. The principal component analysis has been used for channel suppression, but it is difficult to be interpreted because all channels contribute to principal components. Furthermore, the importance of interpretability in machine learning has recently attracted increasing attention in radiology. To improve the interpretability of PCA-based channel suppression, a sparse PCA method is proposed to reduce the most coils’ loadings to be zero. Channel suppression is formulated as solving a nonlinear eigenvalue problem using the inverse power method instead of the direct matrix decomposition. Experimental results of in vivo data show that the sparse PCA-based channel suppression not only improves the interpretability with sparse channels, but also improves reconstruction quality compared to the standard PCA-based reconstruction with the similar reconstruction time.
|
|
08:45-09:00, Paper WeAT10.2 | |
Rapid MR Scanner Independent B1 Field Measurement System for Phased Arrays |
|
Raja Viswanath, Madavan | Texas A&M University |
Wright, Steven M. | Texas A&M University |
Keywords: Magnetic resonance imaging - Parallel MRI, Magnetic resonance imaging - MRI RF coil technology
Abstract: This paper demonstrates a rapid B1 field benchtop measurement system that is independent of an MR scanner and network analyzer. This system can be used to obtain radiofrequency (B1 field) strength distribution plots of multiple 2D slices (with an extension to 3D) of a liquid cylindrical phantom for multi-element phased arrays used in MRI. The system can be used in three modes- element, phased array, and multiple fixed point pattern measurement. These modes are demonstrated for a 7T 1H eight-channel dipole array and a corn-syrup based phantom. The system can measure complex phase and amplitude measurements from up to 8 elements in the first mode one or 8 different phase settings in the second mode at a rate of approximately 37 positions per minute, allowing a full 2D B1 mapping for 1303 points in 33.05 minutes. The scan patterns obtained using this setup are compared to the ones obtained using an HP network analyzer and simulations. This work can be extended to measure the E field, SAR and upon increasing the speed of measurement, could be used for applications such as Transmit SENSE.
|
|
09:00-09:15, Paper WeAT10.3 | |
Learning-Based Method for K-Space Trajectory Design in MRI |
|
Sharma, Shubham | IISc |
K.V.S., Hari | IISc |
Leus, Geert | TU Delft |
Keywords: Magnetic resonance imaging - Pulse sequence, Image reconstruction and enhancement - Compressed sensing / Sampling
Abstract: Variable density sampling of the k-space in MRI is an integral part of trajectory design. It has been observed that data-driven trajectory design methods provide a better image reconstruction as compared to trajectories obtained from a fixed or a parametric density function. In this paper, a data-driven strategy has been proposed to obtain non-Cartesian continuous k-space sampling trajectories for MRI under the compressed sensing framework (greedy non-Cartesian (GNC)). A stochastic version of the algorithm (stochastic greedy non-Cartesian (SGNC)) is also proposed that reduces the computation time. We compare the proposed trajectory with a traveling salesman problem (TSP)-based trajectory and an echo planar imaging-like trajectory obtained by a greedy method called stochastic greedy-Cartesian (SGC) algorithm. The training images are taken from knee images of the fastMRI dataset. It is observed that the proposed algorithms outperform the TSP-based and the SGC trajectories for similar read-out times.
|
|
09:15-09:30, Paper WeAT10.4 | |
Improving Patient Comfort in MRI with Predictive Acoustic Noise Cancelling |
|
Šiurytė, Paulina | Delft University of Technology |
Tourais, Joao | Delft University of Technology |
Weingärtner, Sebastian | Delft University of Technology |
Keywords: Magnetic resonance imaging - Pulse sequence
Abstract: With sound pressure levels reaching up to 130 dB, acoustic noise in Magnetic Resonance Imaging (MRI) is one of the main sources of patient discomfort in otherwise one of the safest medical imaging modalities. In this work, a noise prediction-based approach, termed predictive noise cancelling (PNC), is applied, for the first time, to suppress noise in MRI. In PNC the noise from the scanner gradient coils is predicted based on linear time-invariant models, which relate the individual gradient coil (X, Y and Z) input to the acoustic noise output. A model setup was constructed of a custom speaker box and MRI-compatible microphone to demonstrate live noise reduction. Additional tuning steps, including output channel equalization and clock mismatch correction, were performed to maximize noise reduction. A calibration sequence was designed to determine the model and tuning parameters. Analysis of actual scanner noise shows an upper limit of 21 dB noise reduction with the proposed linear model. For the components of a clinical example sequence, the setup demonstrated in-bore live noise reduction of up to 10 dB (7.01 ± 0.31 dB, 6.42 ± 2.04 dB and 9.28 ± 0.26 dB for X, Y and Z gradient coils respectively) in the presence of system imperfections. The results indicate promising noise attenuation without the need to modify scanner hardware or compromises in acquisition speed or quality. This has potential to substantially and cost effectively improve patient comfort in clinical MRI.
|
|
09:30-09:45, Paper WeAT10.5 | |
Signal-Intensity Informed Multi-Coil MRI Encoding Operator for Improved Physics-Guided Deep Learning Reconstruction of Dynamic Contrast-Enhanced MRI |
|
Demirel, Omer Burak | University of Minnesota |
Yaman, Burhaneddin | University of Minnesota |
Moeller, Steen | University of Minnesota |
Weingärtner, Sebastian | Delft University of Technology |
Akcakaya, Mehmet | University of Minnesota |
Keywords: Magnetic resonance imaging - Dynamic contrast-enhanced MRI, Image reconstruction and enhancement - Machine learning / Deep learning approaches, Magnetic resonance imaging - Parallel MRI
Abstract: Dynamic contrast enhanced (DCE) MRI acquires a series of images following the administration of a contrast agent, and plays an important clinical role in diagnosing various diseases. DCE MRI typically necessitates rapid imaging to provide sufficient spatio-temporal resolution and coverage. Conventional MRI acceleration techniques exhibit limited image quality at such high acceleration rates. Recently, deep learning (DL) methods have gained interest for improving highly-accelerated MRI. However, DCE MRI series show substantial variations in SNR and contrast across images. This hinders the quality and generalizability of DL methods, when applied across time frames. In this study, we propose signal intensity informed multi-coil MRI encoding operator for improved DL reconstruction of DCE MRI. The output of the corresponding inverse problem for this forward operator leads to more uniform contrast across time frames, since the proposed operator captures signal intensity variations across time frames while not altering the coil sensitivities. Our results in perfusion cardiac MRI show that high-quality images are reconstructed at very high acceleration rates, with substantial improvement over existing methods.
|
|
09:45-10:00, Paper WeAT10.6 | |
Mind the Gap: Functional Network Connectivity Interpolation between Schizophrenia Patients and Controls Using a Variational Autoencoder |
|
Li, Xinhui | Georgia Institute of Technology |
Geenjaar, Eloy Philip Theo | Tri-Institutional Center for Translational Research in Neuroimag |
Fu, Zening | Georgia State University |
Plis, Sergey M. | Tri-Institutional Center for Translational Research in Neuroimag |
Calhoun, Vince D | Tri-Institutional Center for Translational Research in Neuroimag |
Keywords: Image reconstruction and enhancement - Machine learning / Deep learning approaches, Brain imaging and image analysis, Magnetic resonance imaging - MR neuroimaging
Abstract: Mental disorders such as schizophrenia have been challenging to characterize due in part to their heterogeneous presentation in individuals. Most studies have focused on identifying groups differences and have typically ignored the heterogeneous patterns within groups. Here we propose a novel approach based on a variational autoencoder (VAE) to interpolate static functional network connectivity (sFNC) across individuals, with group-specific patterns between schizophrenia patients and controls captured simultaneously. We then visualize the original sFNC in a 2D grid according to the samples in the VAE latent space. We observe a high correspondence between the generated and the original sFNC. The proposed framework facilitates data visualization and can potentially be applied to predict the stage that a subject falls within a disorder continuum as well as characterize individual heterogeneity within and between groups.
|
|
WeAT12 |
M1 |
Theme 06. Biomechanics for Rehabiitation |
Oral Session |
Chair: Patton, James | U. Illinois at Chicago (UIC), & the Shirley Ryan Ability Lab (formerly RIC) |
|
08:30-08:45, Paper WeAT12.1 | |
Stability of Inverted Pendulum Reveals Transition between Predictive Control and Impedance Control in Grip Force Modulation |
|
Leib, Raz | Technical University of Munich |
Franklin, Sae | Institute for Cognitive Systems, Technical University of Munich |
Cesonis, Justinas | Technical University of Munich |
Franklin, David W. | Technical University of Munich |
Keywords: Motor learning, neural control, and neuromuscular systems, Neuromuscular systems - Postural and balance, Neuromuscular systems - Learning and adaption
Abstract: During object manipulation, our sensorimotor system needs to represent the object’s dynamics in order to better control it. This is especially important in the case of grip force control where small forces can cause the object to slip from our fingers, and excessive forces can cause fatigue or even damage the object. While the tradeoff between these two constraints is clear for stable objects, such as lifting a soda can, it is less clear how the sensorimotor system adjusts the grip force for unstable objects. For this purpose, we measured the change in the grip force of individual human participants while they stabilize five different lengths of an inverted pendulum. These lengths set different dynamics of the pendulum, ranging in their degree of controllability. We observed two main states during such manipulation, a marginally stable state of the pendulum and a stabilization state in which participants acted to stabilize the system. While during the stabilization state participants increased their applied grip force, for the stable state we observed a mixed behaviour. For small and less controllable pendulums, grip force increased while for larger pendulums, participants could modulate the the grip force according to the anticipated load forces. Based on these results, we suggest that the pendulum dynamics change the control strategy between predictive control and impedance control.
|
|
08:45-09:00, Paper WeAT12.2 | |
Electro-Prosthetic E-Skin Successfully Delivers Elbow Joint Angle Information by Electro-Prosthetic Proprioception (EPP) |
|
Oh, Semyoung | Texas A&M University |
Patton, James | U. Illinois at Chicago (UIC), & the Shirley Ryan Ability Lab (fo |
Park, Hangue | Texas A&M University |
Keywords: Sensory neuroprostheses - Somatosensory, Neural interfaces - Body interfaces, Human performance - Sensory-motor
Abstract: Neurotraumas and neurological diseases often result in compromised proprioceptive feedback, which plays a critical role in motor control by delivering real-time position information. Frequency-modulated electrotactile feedback is a promising solution, as it can deliver proprioceptive information such as a joint angle. Prior works demonstrated that frequency-modulated electrotactile feedback successfully delivered distance information between the end effector and the target object. In this study, we implemented the electronic skin (E-skin) monitoring the elbow joint angle and delivering it to the nervous system via tactile channel. We also demonstrated that frequency-modulated electrotactile feedback improved the elbow joint angle control. The gyroscope measuring the elbow joint angle and electrodes delivering electrotactile feedback were integrated together as a skin using thin silicon coating and polyurethane film. We call this novel E-skin, monitoring and delivering joint angle information, as an electro-prosthetic E-skin. Elbow joint angle matching test with two healthy human subjects showed that the frequency-modulated electrotactile feedback, via electro-prosthetic E-skin, enhanced 101.7% accuracy and 63.8% precision in elbow joint angle control.
|
|
09:00-09:15, Paper WeAT12.3 | |
Static Balance Characterization Using a Single IMU Located in the Lower Back: Preliminary Results |
|
Pinto, Daniela | University of Concepcion |
Pastene, Francisco | Universidad De Concepcion |
Godoy, Julio | Universidad De Concepcion |
Gomez, Britam | University of Concepcion |
Ortega, Paulina | Universidad De Concepción |
Aqueveque, Pablo | Universidad De Concepcion |
Keywords: Neuromuscular systems - Postural and balance
Abstract: Balance refers to the dynamics of body posture to prevent falls. For years, researchers have tried to find out which tasks and measures provide optimal detection of balance disorders, so that they can be quantified. This paper proposes the use of an accelerometer sensor located in the lower back to measure the center of mass accelerations and to characterize the subject’s static balance. For characterizing the static balance objectively, we propose using normality circles, a centroid, and a dispersion circle during the modified Clinical Test of Sensory Interaction in Balance (mCTSIB) test. The proposed methodology was tested using two groups of subjects (10 healthy and 3 unhealthy). Our methodology for the static balance was compared to the Berg Balance Scale score. The results shown that a subject with lower BBS score obtain lower dispersion circle and is outside the normality circle. Also, our method outperforms a new option since it characterizes the static balance in an objective, portable, simple, and low-cost way.
|
|
09:15-09:30, Paper WeAT12.4 | |
Human Performance of Three Hands in Unimanual, Bimanual and Trimanual Tasks |
|
huang, yanpei | Imperial College London |
Eden, Jonathan | Imperial College London |
Ivanova, Ekaterina | Imperial College London |
Burdet, Etienne | Imperial Collge of Science, Technology and Medicine |
Keywords: Human performance, Human performance - Activities of daily living, Human performance - Ergonomics and human factors
Abstract: Trimanual operation using a robotic supernumerary limb is a new and challenging mechanism for human operators that could enable a single user to perform tasks requiring more than two hands. Foot-controlled interfaces have previously proven able to be intuitively controlled, enabling simple tasks to be performed. However, the effect of going from unimanual to bimanual and then to trimanual tasks on subjects performance and coordination is not well understood. In this paper, unimanual, bimanual and trimanual teleoperation tasks were performed in a virtual reality scene to evaluate the impact of extending to trimanual actions. 15 participants were required to move their limbs together in a coordinated reaching activity. The results show that the addition of another hand resulted in an increase in operating time, where the time increased in going from unimanual to bimanual operation and then increased further when going from bimanual to trimanual. Moreover, the success rate for performing bimanual and trimanual tasks was strongly influenced by the subject's performance in ipsilateral hand-foot activities, where the ipsilateral combination had a lower success rate than contralateral limbs. The addition of a hand did not affect any two-hand coordination rate and even in some cases reduced coordination deviations.
|
|
09:30-09:45, Paper WeAT12.5 | |
Overcoming Facial Paralysis with an Implantable Actuator for Restoration of Blink |
|
Cleary, Jacinta Dawn | University of Sydney |
Kékesi, Orsolya | University of Sydney |
Hasmat, Shaheen | University of New South Wales |
Low, Tsu-Hui Hubert | University of Sydney |
Lovell, Nigel H. | University of New South Wales |
Clark, Jonathan | University of Sydney |
Suaning, Gregg | The University of Sydney |
Keywords: Neural interfaces - Implantable systems, Neural interfaces - Biomaterials, Neurorehabilitation
Abstract: The loss of the ability to blink the eyelid is considered the most severe effect of facial nerve paralysis. The delicate homeostasis of the eye is disrupted, and without frequent intervention, the cornea can become damaged, ultimately resulting in blindness. The psychosocial impact is also significant, with individuals withdrawing from society to hide what they perceive to be a disfigurement. Surgical and engineering interventions have been devised to reanimate blink, however, a solution has yet to be designed which addresses both functional and aesthetic concerns. Here we describe an implantable electromagnetic actuator to restore the capacity to blink. Triggered synchronously with the contralateral eye, and externally modifiable to tailor treatment post-operatively to the individual, this implant restores complete blinking and a natural appearance. Cadaver studies (N=12) have been used to validate the device design, including the form factor and force required to elicit a blink, while a passive in vivo study (N=1) has verified the surgical protocol and recovery.
|
|
09:45-10:00, Paper WeAT12.6 | |
Fall Prediction in People with Parkinson’s Disease |
|
Cao, Phuong | University of St. Thomas |
Min, Cheol-Hong | University of St. Thomas |
Keywords: Human performance, Human performance - Gait, Human performance - Activities of daily living
Abstract: A process of predicting fall events in patients with Parkinson’s disease (PD) by using a simple motion sensor is described in this paper. Causes of falls in people with PD can be postural instability, freezing of gait, festinating gait, dyskinesias, visuospatial dysfunction, orthostatic hypotension, and posture problems. This study uses only one motion sensor in collecting data. Thus, only fall events caused by festinating gait factors which are moments when the patient suddenly moves faster with smaller steps can be performed and tested. In this preliminary study, fall event scenarios of simulated test cases are performed by five healthy young subjects who are 20 to 28 years old. The acceleration mode in the motion sensor provides information that can detect how fast the subjects move. Data that is collected by the sensor will be analyzed by simple analysis methods and machine learning techniques classification. The proposed study achieved an accuracy of 70.3% for the 10-class model while for binary classification it was 99%.
|
|
WeAT13 |
Hall 1 |
Theme 02. Ultrasound Imaging |
Oral Session |
Chair: Sudhakara Murthy, Prasad | GE Healthcare |
|
08:30-08:45, Paper WeAT13.1 | |
Focal U-Net: A Focal Self-Attention Based U-Net for Breast Lesion Segmentation in Ultrasound Images |
|
Zhao, Haochen | Beihang University |
Niu, Jianwei | Beihang University |
Meng, Hui | Hangzhou Innovation Institute, Beihang University |
Wang, Yong | Chinese Academy of Medical Sciences and Peking Union Medical Col |
Li, Qingfeng | Hangzhou Innovation Institute, Beihang University |
Yu, Ziniu | Beihang University |
Keywords: Ultrasound imaging - Breast
Abstract: Accurate breast lesion segmentation in ultrasound images helps radiologists to make exact diagnoses and treatments, which is important to increase the survival rate of breast cancer patients. Recently, deep learning-based methods have demonstrated remarkable results in breast lesion segmentation. However, the blurry breast lesion boundaries and noise artifacts in ultrasound images still limit the performance of the deep learning-based methods. In this paper, we propose a novel segmentation network equipped with a focal self-attention block for improving the performance of breast lesion segmentation. The focal self-attention block can incorporate finegrained local and coarse-grained global information. The finegrained local information is useful to enhance features of breast lesion boundaries, while the coarse-grained global information effectively reduces noise interference. To verify the performance of our network, we implement breast lesion segmentation on our collected dataset of 9836 ultrasound images. The results demonstrate that the focal self-attention block enhances features of breast lesion boundaries and improves the accuracy of breast lesion segmentation.
|
|
08:45-09:00, Paper WeAT13.2 | |
Mechanical Validation of Viscoelastic Parameters for Different Interface Pressures Using the Kelvin-Voigt Fractional Derivative Model |
|
Tecse, Aldo | Pontificia Universidad Católica Del Perú |
Romero, Stefano | Pontificia Universidad Católica Del Perú |
Romero, Carlos Jesus | Pontificia Universidad Católica Del Perú |
Naemi, Roozbeh | Staffordshire University |
Castañeda, Benjamín | Pontificia Universidad Católica Del Perú |
Keywords: Ultrasound imaging - Elastography, Image reconstruction - Performance evaluation
Abstract: The knowledge of the biomechanical properties of tissues is useful for different applications such as disease diagnosis and treatment monitoring. Reverberant Shear Wave Elastography (RSWE) is an approach that has reduced the restrictions on wave generation to characterize the shear wave velocity over a range of frequencies. This approach is based on the generation of a reverberant field that is generated by the reflections of waves from inhomogeneities and tissue boundaries that exist in the tissue. The Kelvin-Voigt Fractional Derivative model is commonly used to characterize elasticity and viscosity of soft tissue when using shear wave ultrasound elatography. These viscoelastic characteristics can be then validated using mechanical measurements (MM) such as stress relaxation. During RSWE acquisition, the effect of interface pressure, induced by pushing the probe on the skin through the gel pad, on the viscous and elastic characteristics of tissue can be investigated. However, the effect of interface pressure on the validity of the extracted viscous and elastic characteristics was not investigated before. Therefore, the purpose of this study was to compare the estimation of the viscoelastic parameters at different thickness of gel pad against the viscoelastic characteristics obtained from MM. The experiments were conducted in a tissue-mimicking phantom. The results confirm that the relaxed elastic constant can be depreciated. In addition, a higher congruence was found in the viscous parameter estimated at 6 and 7 mm. On the other hand, a difference in the order of fractional derivative was found.
|
|
09:00-09:15, Paper WeAT13.3 | |
From 2D Ultrasound to Patient-Specific 3D Surface Models for Interventional Guidance |
|
Annangi, Pavan Kumar | GE Healthcare |
Sudhakar, Prasad | GE Healthcare |
Michael, Washburn | Chief Engineer |
Keywords: Ultrasound imaging - Interventional, Image reconstruction and enhancement - Machine learning / Deep learning approaches, Machine learning / Deep learning approaches
Abstract: Ultrasound exam output of large organs like liver has traditionally been limited to still images or 2D cine loops of key structures, without 3D context. Having 3D context for follow up studies makes ultrasound scanning much easier and for interventional applications such as biopsy. 3D context will reduce wrong sample selection thereby increasing patient comfort. As of today, there is no existing solution which provides 3D anatomical context to users during scanning for large organs like liver. Even for routine measurements like liver volume, patients have to undergo CT or MR scan. In this paper, we propose a novel approach to build-patient specific 3D anatomical surface models from B-mode ultrasound images and tracking information from position sensors. The complexity of the problem stems from the fact that liver boundaries are often not very clear in ultrasound images, in addition to large variability in liver size and shape across patients. Our work uses state-of-the-art deep learning algorithms to detect surface landmarks of liver followed by registering a geometric model to surface point cloud to build patient specific 3D liver model. Further, the developed models will be used to guide users to right lesion locations during the interventional procedure. Our proposed semi-automated workflow ensures the accuracy of the developed models are within acceptable limits for the targeted problem.
|
|
09:15-09:30, Paper WeAT13.4 | |
AEPUS: A Tool for the Automated Extraction of Pennation Angles in Ultrasound Images with Low Signal-To-Noise Ratio for Plane-Wave Imaging |
|
Vostrikov, Sergei | ETH Zurich |
Cossettini, Andrea | ETH Zurich |
Leitner, Christoph | Graz University of Technology |
Baumgartner, Christian | Graz University of Technology |
Benini, Luca | University of Bologna |
Keywords: Ultrasound imaging - Other organs, Image feature extraction
Abstract: The penetrating ability of ultrasound (US) combined with its real-time operation make it the perfect tool for investigating muscle contraction mechanics during complex functional tasks, e.g., locomotion. Changes in fascicle lengths and pennation angles of muscle fascicles strongly correlate with the capacity of skeletal muscles to produce forces, thereby represent fundamental parameters to be tracked. While the gold standard for extracting these features from US images is still based on manual annotation, the availability of recording devices capable of generating big data of muscle dynamics makes such manual approach unfeasible, setting the need for automated muscle images annotation tools. Existing approaches, however, are seriously limited, also in view of the continuous developments and technology advancements for ultrafast US and plane-wave imaging. In fact, they rely on conventional (slow) B-mode imaging, make use of point tracking approaches (which often fail due to out-of-plane motion), or can only operate on very high quality images. To overcome all these limitations, we present AEPUS, an automated image labeling tool capable of extracting pennation angles from low quality images using a very small number of plane waves, therefore making it capable of exploiting all the benefits of ultrafast US. Clinical relevance — Ultrasound is a standard research tool to investigate alterations of spastic muscles in children with Cerebral Palsy. We propose a reliable and time-efficient method to track muscle features in ultrasound images and support clinical biomechanists in their analyses
|
|
09:30-09:45, Paper WeAT13.5 | |
Detection of COVID-19 in Point of Care Lung Ultrasound |
|
Maximino, Joana | Faculdade De Ciências, Universidade Do Porto |
Coimbra, Miguel | INESC TEC / Universidade Do Porto |
Pedrosa, João | INESC TEC |
Keywords: Ultrasound imaging - Other organs, Image analysis and classification - Machine learning / Deep learning approaches
Abstract: The coronavirus disease 2019 (COVID-19) evolved into a global pandemic, responsible for a significant number of infections and deaths. In this scenario, point-of-care ultrasound (POCUS) has emerged as a viable and safe imaging alternative. Computer vision (CV) solutions have been proposed to aid clinicians in POCUS image interpretation, namely detection/segmentation of structures and image/patient classification but relevant challenges still remain. As such, the aim of this study is to develop CV algorithms, using Deep Learning techniques, to create tools that can aid doctors in the diagnosis of viral (COVID-19) and bacterial pneumonia (BP) through POCUS exams. To do so Convolutional Neural Networks (CNNs) were designed to perform in classification tasks. The architectures chosen to build these models were the VGG16, ResNet50, DenseNet169 e MobileNetV2. Patients images were divided in three classes: healthy (HE), BP and viral pneumonia (VP) which falls under COVID-19. Through a comparative study, which was based on several performance metrics, the model based on the DenseNet169 architecture was designated as the best performing model, achieving 78% average accuracy value of the five iterations of 5-Fold Cross-Validation (5FCV). Given that the currently available POCUS datasets for COVID-19 are still limited, the training of the models was negatively affected by such and the models were not tested in an independent dataset. Furthermore, it was also not possible to perform lesion detection tasks. Nonetheless, in order to provide explainability and understanding of the models, Gradient-weighted Class Activation Mapping (GradCAM) were used as a tool to highlight the most relevant classification regions.
|
|
09:45-10:00, Paper WeAT13.6 | |
Multi-Level Classification of Lung Pathologies in Neonates Using Recurrence Features |
|
Aujla, Sagarjit | Ryerson University |
Mohamed, Adel | Mount Sinai, University of Toronto |
Khan, Naimul | Ryerson University |
Umapathy, Karthikeyan | Ryerson University |
Keywords: Ultrasound imaging - Other organs, Fetal and Pediatric Imaging, Image analysis and classification - Machine learning / Deep learning approaches
Abstract: The use of Lung Ultrasound (LUS) as a tool to diagnose and monitor lung diseases in neonates has increased in urban hospitals. Lung ultrasound’s main advantages compared to chest CT or X-rays is that it is less expensive, more accessible, and does not expose the patient to radiation. Performing LUS on neonates and diagnosing the LUS images require highly trained specialist medical professional and clinicians. While availability of such specialists in general is not an issue in urban areas, there is lack of such personnel in rural and remote communities. Hence, an automated computer-aided screening approach as a first level diagnosis assistance in such scenarios might be of significant value. Many of the image morphologies (or patterns) used by clinicians in diagnosing the LUS have strong recurrence characteristics. Building upon this knowledge, in this paper, we propose a feature extraction method designed to quantify such recurrent features for classification of LUS images into 6 common neonatal lung conditions. These conditions were normal lung, chronic lung disease (CLD), transient tachypnea of the newborn (TTN), pneumothorax (PTX), respiratory distress syndrome (RDS), and consolidation (CON) that could be pneumonia or atelectasis. The proposed method extracts virtual scanlines from the LUS images and converts them into signals. Then using recurrence quantification analysis (RQA), features were extracted and fed to pattern classifiers. Using a simple linear classifier the proposed features can achieve a classification accuracy of 69.3% without clinical features and 77.6% with clinical features.
|
|
WeAT14 |
Clyde Auditorium |
Theme 02. CT Imaging |
Oral Session |
Chair: Navarro, Fernando | TUM |
Co-Chair: Alirezaie, Javad | Ryerson University, Univ of Waterloo |
|
08:30-08:45, Paper WeAT14.1 | |
Unsupervised Domain Adaptation Using Adversarial Learning and Maximum Square Loss for Liver Tumors Detection in Multi-Phase CT Images |
|
JAIN, Rahul Kumar | College of Information Science and Engineering, Ritsumeikan Univ |
SATO, Takahiro | Tiwaki Co. Ltd., Shiga, Japan |
WATASUE, Taro | Tiwaki Co. Ltd., Shiga, Japan |
NAKAGAWA, Tomohiro | Tiwaki Co. Ltd., Kusatsu, Japan |
Iwamoto, Yutaro | Ritsumeikan University |
Han, Xianhua | Ritsumeikan University |
Lin, Lanfen | Zhejiang University |
Hu, Hongjie | Sir Run Run Shaw Hospital |
Ruan, Xiang | Tiwaki Co. Ltd., Shiga, Japan |
Chen, Yen-Wei | Ritsumeikan University |
Keywords: CT imaging, Image reconstruction and enhancement - Machine learning / Deep learning approaches, Machine learning / Deep learning approaches
Abstract: Automatic and efficient liver tumor detection in multi-phase CT images is essential in computer-aided diagnosis of liver tumors. Nowadays, deep learning has been widely used in medical applications. Normally, deep learning-based AI systems need a large quantity of training data, but in the medical field, acquiring sufficient training data with high-quality annotations is a significant challenge. To solve the lack of training data issue, domain adaptation-based methods have recently been developed as a technique to bridge the domain gap across datasets with different feature characteristics and data distributions. This paper presents a domain adaptation-based method for detecting liver tumors in multi-phase CT images. We adopt knowledge for model learning from PV phase images to ART and NC phase images. To minimize the domain gap, we employ an adversarial learning scheme with the maximum square loss for mid-level output feature maps using an anchorless detector. Experiments show that our proposed method performs much better for various CT-phase images than normal training.
|
|
08:45-09:00, Paper WeAT14.2 | |
Boundary Attention U-Net for Kidney and Kidney Tumor Segmentation |
|
Zhao, Zhongchen | Shanghai Jiao Tong University |
Chen, Huai | Shanghai Jiao Tong University |
Li, Jiang | Shanghai Jiao Tong University |
Wang, Lisheng | Shanghai Jiao Tong University |
Keywords: CT imaging, Image segmentation, Machine learning / Deep learning approaches
Abstract: Kidney cancer is one of the common cancers in the world. Automatic segmentation of the kidney and kidney tumor from CT images is of great significance for the therapy treatment of kidney cancer. Due to the diversity of the kidney tumor in terms of location, size, and shape, current methods have limited performance on the tumor segmentation, especially on the boundary. This paper proposes an effective deep neural network with multi-task learning paradigm to improve the boundary segmentation accuracy. The network is an improved U-Net model enhanced by a novel boundary attention mechanism, named boundary attention U-Net (BAU-Net). It consists of a main branch to segment the target regions and an auxiliary branch to generate boundary attention maps to boost the segmentation. Our method is an extension of our original competition method, in which we improve the classical coarse-to-fine framework by using kidney information to segment tumors. Our method is quantitatively evaluated on a public dataset from MICCAI 2021 Kidney and Kidney Tumor Segmentation Challenge (KiTS2021), with mean Dice similarity coefficient (DSC) as 98.04% and 84.09% for the kidney and kidney tumor respectively, outperforming all competitive methods of KiTS2021.
|
|
09:00-09:15, Paper WeAT14.3 | |
A Unified 3D Framework for Organs at Risk Localization and Segmentation for Radation Therapy Planning |
|
Navarro, Fernando | TUM |
Sasahara, Guido | TUM |
Shit, Suprosanna | TUM |
Sekuboyina, Anjany Kumar | Technical University of Munich |
Ezhov, Ivan | TUM |
Peeken, Jan | TUM |
Combs, Stephanie E. | TUM |
Menze, Bjoern | University of Zurich |
Keywords: CT imaging, Image segmentation, Image feature extraction
Abstract: Automatic localization and segmentation of organs-at-risk (OAR) in CT are essential pre-processing steps in medical image analysis tasks, such as radiation therapy planning. For instance, the segmentation of OAR surrounding tumors enables the maximization of radiation to the tumor area without compromising the healthy tissues. However, the current medical workflow requires manual delineation of OAR, which is prone to errors and is annotator-dependent. In this work, we aim to introduce a unified 3D pipeline for OAR localization-segmentation rather than novel localization or segmentation architectures. To the best of our knowledge, our proposed framework fully enables the exploitation of 3D context information inherent in medical imaging. In the first step, a 3D multi-variate regression network predicts organs’ centroids and bounding boxes. Secondly, 3D organ-specific segmentation networks are leveraged to generate a multi-organ segmentation map. Our method achieved an overall Dice score of 0.9260 ± 0.18% on the VISCERAL dataset containing CT scans with varying fields of view and multiple organs.
|
|
09:15-09:30, Paper WeAT14.4 | |
Dilated Convolution ResNet with Boosting Attention Modules and Combined Loss Functions for LDCT Image Denoising |
|
Marcos, Luella | Ryerson University |
Quint, Franz | Hochschule Karlsruhe University of Applied Sciences |
Babyn, Paul | University of Saskatchewan |
Alirezaie, Javad | Ryerson University, Univ of Waterloo |
Keywords: CT imaging, Image enhancement - Denoising, Image reconstruction and enhancement - Machine learning / Deep learning approaches
Abstract: With the increasing concern regarding the radiation exposure of patients undergoing computed tomography (CT) scans, researchers have been using deep learning techniques to improve the quality of denoised low-dose CT (LDCT) images. In this paper, a cascaded dilated residual network (ResNet) with integrated attention modules, specifically spatial- and channel- attention modules, is proposed. Further, an investigation regarding the effectiveness of per-pixel loss, perceptual loss via VGG16-Net, and SSIM loss functions is covered through an ablation experiment. By knowing how these loss functions affect the output denoised images, a combination of the these loss function is then proposed which aims to prevent edge over-smoothing, enhance textural details and finally, preserve structural details on the denoised images. Finally, bench testing was also done by comparing the visual and quantitative results of the proposed model with the state-of-the-art models such as block matching 3D (BM3D), patch-GAN and dilated convolution with edge detection layer (DRL-E-MP) for accuracy.
|
|
09:30-09:45, Paper WeAT14.5 | |
Dual Discriminator-Based Unsupervised Domain Adaptation Using Adversarial Learning for Liver Segmentation on Multiphase CT Images |
|
Ananda, Swathi | RITSUMEIKAN UNIVERSITY |
Iwamoto, Yutaro | Ritsumeikan University |
Han, Xianhua | Ritsumeikan University |
Lin, Lanfen | Zhejiang University |
Hu, Hongjie | Sir Run Run Shaw Hospital |
Chen, Yen-Wei | Ritsumeikan University |
Keywords: CT imaging, Image segmentation, Image analysis and classification - Machine learning / Deep learning approaches
Abstract: Multiphase computed tomography (CT) images are widely used for the diagnosis of liver disease. Since each phase has different contrast enhancement (i.e., different domain), the multiphase CT images should be annotated for all phases to perform liver or tumor segmentation, which is a time-consuming and labor-expensive task. In this paper, we propose a dual discriminator-based unsupervised domain adaptation (DD-UDA) for liver segmentation on multiphase CT images without annotations. Our framework consists of three modules: a task-specific generator and two discriminators. We have performed domain adaptation at two levels: one is at the feature level, and the other is at the output level, to improve accuracy by reducing the difference in distributions between the source and target domains. Experimental results using public data (PV phase only) as the source domain and private multiphase CT data as the target domain show the effectiveness of our proposed DD-UDA method.
|
|
09:45-10:00, Paper WeAT14.6 | |
Cloud-YLung for Non-Small Cell Lung Cancer Histology Classification from 3D Computed Tomography Whole-Lung Scans |
|
Tomassini, Selene | Marche Polytechnic University |
Falcionelli, Nicola | Marche Polytechnic University |
Sernani, Paolo | Marche Polytechnic University |
Sbrollini, Agnese | Università Politecnica Delle Marche |
Morettini, Micaela | Università Politecnica Delle Marche |
Burattini, Laura | Università Politecnica Delle Marche |
Dragoni, Aldo Franco | Marche Polytechnic University |
Keywords: CT imaging, Image analysis and classification - Machine learning / Deep learning approaches
Abstract: Non-Small Cell Lung Cancer (NSCLC) represents up to 85% of all malignant lung nodules. Adenocarcinoma and squamous cell carcinoma account for 90% of all NSCLC histotypes. The standard diagnostic procedure for NSCLC histotype characterization implies cooperation of 3D Computed Tomography (CT), especially in the form of low-dose CT, and lung biopsy. Since lung biopsy is invasive and challenging (especially for deeply-located lung cancers and for those close to blood vessels or airways), there is the necessity to develop non-invasive procedures for NSCLC histology classification. Thus, this study aims to propose Cloud-YLung for NSCLC histology classification directly from 3D CT whole-lung scans. With this aim, data were selected from the openly-accessible NSCLC-Radiomics dataset and a modular pipeline was designed. Automatic feature extraction and classification were accomplished by means of a Convolutional Long Short-Term Memory (ConvLSTM)-based neural network trained from scratch on a scalable GPU cloud service to ensure a machine-independent reproducibility of the entire framework. Results show that Cloud-YLung performs well in discriminating both NSCLC histotypes, achieving a test accuracy of 75% and AUC of 84%. Cloud-YLung is not only lung nodule segmentation free but also the first that makes use of a ConvLSTM-based neural network to automatically extract high-throughput features from 3D CT whole-lung scans and classify them.
|
|
WeBT3 |
Boisdale-1 |
Theme 03. Micro/Nano-Bioengineering; Cellular/Tissue Engineering &
Biomaterials |
Oral Session |
Chair: Puttaswamy, Srinivasu Valagerahally | ULSTER UNIVERSITY |
Co-Chair: Ilyas, Azhar | New York Institute of Technology |
|
10:30-10:45, Paper WeBT3.1 | |
Controlled Biodegradation and Swelling of Strontium-Doped Alginate/Collagen Scaffolds for Bone Tissue Engineering |
|
Khondkar, Shams | New York Institute of Technology |
Tharakan, Shebin | New York Institute of Technology |
Badran, Abdulhadi | New York Institute of Technology |
Hadjiargyrou, Michael | New York Institute of Technology |
Ilyas, Azhar | New York Institute of Technology |
Keywords: Scaffolds in tissue engineering - Patterned 3D, Scaffolds in tissue engineering
Abstract: Treatment for critical size defects (CSDs) in bone often use bone grafts to act as a scaffold to help complete healing. Biological scaffolds require bone extraction from the individual or an outside donor while synthetic grafts mostly suffer from poor degradation kinetics and decreased bioactivity. In this study, we investigated a 3D printed scaffold derived from a novel composite bioink composed of alginate and collagen augmented with varying doses from 2 mg/mL to 20 mg/mL of 1% strontium-calcium polyphosphate (SCPP) to control biodegradability and fluid uptake. Scaffolds with increased SCPP concentrations showed higher particle density, lesser swelling ratio and greater biodegradability indicating that these critically important properties for bone healing are fine-tunable and highly dependent on SCPP dosing. Clinical Relevance— The dosing of 1% SCPP into porous alginate/collagen scaffolds provides adjustable long-term degradation and material properties suitable for potential in vivo CSD applications.
|
|
10:45-11:00, Paper WeBT3.2 | |
Broadband Microwave Electroporation Device for the Analysis of the Influence of Frequency, Temperature and Electrical Field Strength |
|
Paravicini, Markus | Technical University of Darmstadt |
Milden, Manuela | Technical University of Darmstadt |
Pimentel Paes Frank, Laura M. | Technical University of Darmstadt |
Schuessler, Martin | TU Darmstadt |
Cardoso, Cristina | Darmstadt University of Technology |
Jakoby, Rolf | TU Darmstadt |
Hessinger, Carolin | TU Darmstadt |
Keywords: Electromagnetic field effects and cell membrane, Electric fields - Tissue regeneration, Non-viral gene delivery
Abstract: In this paper, a broadband microwave device for cell poration is presented, that enables the analysis of the relation between frequency, electrical field strengths and temperature for a successful cell poration. Electromagnetic-thermal coupled simulations in the frequency range from 1 GHz to 10 GHz show that the device reaches electrical field strengths of 100 V/cm and temperatures lower then 40°C. Electroporation experiments with adherent C2C12 mouse myoblast cells show successful uptake of a anti-histone g-H2A.X nanobody at a frequency of 10 GHz.
|
|
11:00-11:15, Paper WeBT3.3 | |
Direct Laser 3D Printing of Organic Semiconductor Microdevices for Bioelectronics and Biosensors |
|
Dadras-Toussi, Omid | University of Houston |
Raghunathan, VijayKrishna | Novartis Institutes for BioMedical Research |
Majd, Sheereen | University of Houston |
Abidian, Mohammad Reza | University of Houston |
Keywords: Micro- and nano-sensors, Scaffolds in tissue engineering - Patterned 3D, Biomaterials - Chemical and electrochemical sensors
Abstract: Fabrication of conductive and bioactive microdevices has garnered tremendous attention in the emerging biomedical fields, particularly organic bioelectronics and biosensing. Direct laser 3D printing based on two-photon polymerization (TPP) has shown great promise in construction of well-defined and multi-functional microdevices. Herein, we present a novel photosensitive resin for fabrication of highly conductive and bioactive microstructures via TPP. This resin is based on poly(ethylene glycol) diacrylate that is doped with poly (3,4-ethylenedioxythiophene)-poly(styrenesulfonate) (organic semicoductor), and laminin (extracellular matrix protein) or glucose oxidase (biorecognition enzyme). We demonstrate the fabrication of hybrid microelectrodes, bioactive microstructures for cellular adhesion / spreading, and high-performance glucose biosensors.
|
|
11:15-11:30, Paper WeBT3.4 | |
Effect of Functional Electrical Stimulation on Capillary Blood Flow to Muscle |
|
Puttaswamy, Srinivasu Valagerahally | ULSTER UNIVERSITY |
Bhattacharya, Gourav | Ulster University |
Raj, Shasidran | ULSTER UNIVERSITY |
Bhalla, Nikhil | Ulster University |
Lee, Chengkuo | National University of Singapore |
McLaughlin, James | University of Ulster |
Keywords: BioMEMS/NEMS - Tissue engineering and biomaterials, Electric fields - Tissue regeneration, Micro- and nano-sensors
Abstract: Abstract— Functional electrical stimulation (FES) modifies red blood cells (RBC) flux in blood capillaries of muscle. In this work, we aim to investigate changes in the RBC flux in small and large capillaries due to FES using zinc oxide nanowires (ZnO NWs) based electrode at different stimulation parameters. The RBC flux was significantly increased immediately after stimulation, which was evident from decreasing light intensity measured in the region of interest.
|
|
11:30-11:45, Paper WeBT3.5 | |
In Situ Stability Monitoring of Platinum Thin-Film Electrodes for Neural Interfaces in the Presence of Proteins |
|
Doering, Moritz | University of Freiburg |
Kieninger, Jochen | University of Freiburg |
Urban, Gerald A. | University of Freiburg |
Weltin, Andreas | University of Freiburg |
Keywords: Micro- and nano-sensors, Micro- and nano-technology
Abstract: The long-term stability of platinum electrodes is a key factor that determines the life-time of biomedical devices, such as implanted neural interfaces like brain stimulation or recording electrodes, cochlear implants, and biosensors. The downsizing of such devices relies on the usage of microfabricated thin-film electrodes. In order to determine and investigate the causal degradation processes for platinum electrodes, it is essential to use potential-controlled experiments, which allow selectable polarization of the electrode without exceeding the water stability window boundaries. Therefore, the surface processes and redox reactions occurring at the electrode are known at all times. In this study, we present the continuous in situ monitoring of platinum-based thin-film electrodes along their complete life cycle in neutral pH with and without the presence of proteins. The usage of chronoamperometry for electrode aging, monitoring of surface processes and the tracking of analyte redox processes, together with cyclic voltammetry to determine the complete amount of surface charge, allows a reliable quantification of fundamental degradation processes. We found that platinum dissolution is primarily driven by the formation and removal of Pt oxide. Despite the significantly lowered charge transfer, the presence of proteins did not prevent material loss or increase electrode lifetime. These results should be considered when interpreting results from current-controlled methods as typically used for neural interfaces.
|
|
11:45-12:00, Paper WeBT3.6 | |
Effects of a Fluoroscopy Agent on Radio Opacity and Steering Performance of Pressure-Driven Steerable Micro Guidewire |
|
Park, Chan Young | KAIST |
Lee, Doo Yong | KAIST |
Keywords: Micro- and nano-technology, Microfluidic applications
Abstract: This paper presents experimental results of effects of a fluoroscopy agent on the radio opacity and steering performance of the steerable micro guidewire. The guidewire is driven by internal pressure, and made of the silicone polymer mixed with the barium sulfate, BaSO4, masterbatch. Steerable distal tips with different BaSO4 densities up to 30 % are fabricated. The radio opacity is measured by comparing CT (computed tomography) numbers of the steerable distal tips. The steering performance is measured by the bending angle at particular internal pressures while being bent up to 180°. Experiment results show that the radio opacity improves while the bending stiffness decreases as the concentration of the barium sulfate increases.
|
|
WeBT4 |
Boisdale-2 |
Theme 04. Multiscale Modeling of Cells, Tissues, and Organs |
Oral Session |
Chair: Dokos, Socrates | University of New South Wales |
|
10:30-10:45, Paper WeBT4.1 | |
Model-Based Investigation of Elasticity and Spectral Exponent from Atomic Force Microscopy and Electrophysiology in Normal versus Schizophrenia Human Cerebral Organoids |
|
Dutta, Anirban | University at Buffalo SUNY |
Biber, John | University at Buffalo SUNY |
Bae, Yongho | University at Buffalo SUNY |
Augustyniak, Justyna | University at Buffalo SUNY |
Liput, Michal | University at Buffalo SUNY |
Stachowiak, Ewa | University at Buffalo SUNY |
Stachowiak, Michal | University at Buffalo SUNY |
Keywords: Modeling of cell, tissue, and regenerative medicine - 2d and 3d cell modeling, Modeling of cell, tissue, and regenerative medicine - Tissue profiling , Modeling of cell, tissue, and regenerative medicine - Cells
Abstract: The physiological origin of the aperiodic signal present in the electrophysiological recordings, called 1/f neural noise, is unknown; nevertheless, it has been associated with health and disease. The power spectrum slope, - in 1/f, has been postulated to be related to the dynamic balance between excitation (E) and inhibition (I). Our study found that human cerebral organoids grown from induced pluripotent stem cells (iPSCs) from Schizophrenia patients (SCZ) showed structural changes associated with altered elasticity compared to that of the normal cerebral organoids. Furthermore, mitochondrial drugs modulated the elasticity in SCZ which was found related to the changes in the spectral exponent. Therefore, we developed an electro-mechanical model that related the microtubular-actin tensegrity structure to the elasticity and the 1/f noise. Model-based analysis showed that a decrease in the number and length of the constitutive elements in the tensegrity structure decreased its elasticity and made the spectral exponent more negative while thermal white noise will make . Based on the microtubular-actin model and the cross-talk in structural (elasticity) and functional (electrophysiology) response, aberrant mitochondrial dynamics in SCZ are postulated to be related to the deficits in mitochondrial-cytoskeletal interactions for long-range transport of mitochondria to support synaptic activity for E/I balance.
|
|
10:45-11:00, Paper WeBT4.2 | |
Prediction of the Atherosclerotic Plaque Development in Carotid Arteries; the Effect of T-Cells |
|
Pleouras, Dimitrios S. | Research Comittee of the University of Ioannina, GR 45110 Ioanni |
Mantzaris, Michalis | Unit of Medical Technology and Intelligent Information Systems, |
Siogkas, Panagiotis | FORTH-IMBB |
Tsakanikas, Vasilis D. | University of Ioannina |
Potsika, Vassiliki | Unit of Medical Technology and Intelligent Information Systems, |
Sakellarios, Antonis | Forth-Biomedical Research Institute |
Tsompou, Panagiota | Unit of Medical Technology and Intelligent Information Systems, |
Sigala, Fragiska | First Propedeutic Department of Surgery, National and Kapodistri |
Fotiadis, Dimitrios I. | University of Ioannina |
Keywords: Computational modeling - Analysis of high-throughput systems biology data
Abstract: The carotid artery disease is one of the leading causes of mortality worldwide, as it leads to the progressive arterial stenosis that may result to stroke. To address this issue, the scientific community is attempting not only to enrich our knowledge on the underlying atherosclerotic mechanisms, but also to enable the prediction of the atherosclerotic progression. This study investigates the role of T-cells in the atherosclerotic plaque growth process through the implementation of a computational model in realistic geometries of carotid arteries. T-cells mediate in the inflammatory process by secreting interferon-γ that enhances the activation of macrophages. In this analysis, we used 5 realistic human carotid arterial segments as input to the model. In particular, magnetic resonance imaging data, as well as, clinical data were collected from the patients at two time points. Using the baseline data, plaque growth was predicted and correlated to the follow-up arterial geometries. The results exhibited a very good agreement between them, presenting a high coefficient of determination R2=0.64.
|
|
11:00-11:15, Paper WeBT4.3 | |
Reconstruction of the Gastro-Esophageal Junction Based on Ultramill Imaging for Biomechanical Analysis |
|
Xu, Jack | The University of Auckland |
Cheng, Leo K | The University of Auckland |
Avci, Recep | The University of Auckland |
Du, Peng | The University of Auckland |
Keywords: Organ modeling, Models of organ physiology
Abstract: The gastro-esophageal junction (GEJ) regulates the entry of food into the stomach and prevents reflux of acidic gastric contents into the lower esophagus. This is achieved through multiple mechanisms and the maintenance of a localized high-pressure zone. Diseases of the GEJ typically involve impairments to its muscular functions and often are accompanied by symptoms of reflux, heartburn, and dysphagia. This study aimed to develop a finite element-based model from a unique human cadaver GEJ data reconstructed from an ultramill imaging setup. A pipeline was developed to generate a mesh from an input stack of images. The anatomy of the model was compared to an existing Visible Human finite element GEJ model. Biomechanical simulations were also performed on both models using loading steps of differing levels of calcium to model different levels of contraction. It was found that the ultramill GEJ model is shorter than the Visible Human model (31 vs 48.3 mm), as well as producing lower pressure (1.35 vs 4.36 kPa). The model will be used to investigate detailed pressure development in the GEJ during swallowing under realistic loading conditions.
|
|
11:15-11:30, Paper WeBT4.4 | |
Monte Carlo Simulation of the Effect of Human Skin Melanin in Light-Tissue Interactions |
|
Al-Halawani, Raghda | Research Centre for Biomedical Engineering, City University of L |
Chatterjee, Subhasri | City, University of London |
Kyriacou, Panayiotis | City University London |
Keywords: Data-driven modeling
Abstract: Recent reports have highlighted the potential challenges skin pigmentation can have in the accurate estimation of arterial oxygen saturation when using a pulse oximeter. Pulse oximeters work on the principle of photoplethysmography (PPG), an optical technique used for the assessment of volumetric changes in vascular tissue. The primary aim of this research is to investigate the effect of melanin on tissue when utilising the technique of PPG. To address this, a Monte Carlo (MC) light-tissue interaction model is presented to explore the behaviour of melanin in the visible range in the epidermis. A key novelty in this paper is the ability to model the Modified Beer Lambert Law (MBLL) through a fully functional three-dimensional (3D) model in reflective optical geometry. Maximum photon penetration depth was achieved by red light, however limited bio-optical information was retrieved by moderately and darkly pigmented skin at source-detector separations of less than 3 mm. The current MC model can be modified to provide a more realistic representation of absorption and scattering processes in skin.
|
|
11:30-11:45, Paper WeBT4.5 | |
From Real-Time Single to Multicompartmental Hodgkin-Huxley Neurons on FPGA for Bio-Hybrid Systems |
|
Beaubois, Romain | IMS Laboratory, UMR5218, University of Bordeaux |
Khoyratee, Farad | IMS Laboratory, UMR5218, University of Bordeaux |
Branchereau, Pascal | INCIA, UMR5287, CNRS, University of Bordeaux |
Ikeuchi, Yoshiho | Institute of Industrial Science (IIS), the University of Tokyo ; |
Levi, Timothee | University of Bordeaux |
Keywords: Modeling of cell, tissue, and regenerative medicine - 2d and 3d cell modeling, Modeling of cell, tissue, and regenerative medicine - Cells, Computational modeling - Biological networks
Abstract: Modeling biological neural networks has been a field opening to major advances in our understanding of the mechanisms governing the functioning of the brain in normal and pathological conditions. The emergence of real-time neuromorphic platforms has been leading to a rising significance of bio-hybrid experiments as part of the development of neuromorphic biomedical devices such as neuroprosthesis. To provide a new tool for the neurological disorder characterization, we design real-time single and multicompartmental Hodgkin-Huxley neurons on FPGA. These neurons allow biological neural network emulation featuring improved accuracy through compartment modeling and show integration in bio-hybrid system thanks to its real-time dynamics.
|
|
11:45-12:00, Paper WeBT4.6 | |
An Open-Source Computational Model of Neurostimulation of the Spinal Pudendo-Vesical Reflex for the Recovery of Bladder Control after Spinal Cord Injury |
|
Fang, Xiaoqi | University of Pittsburgh |
Collins, Scott | University of Pittsburgh |
Jantz, Maria | University of Pittsburgh |
Nanivadekar, Ameya | University of Pittsburgh |
Gaunt, Robert | University of Pittsburgh |
Capogrosso, Marco | University of Pittsburgh |
Keywords: Computational modeling - Biological networks, Models of organ physiology, Model building - Sensitivity analysis
Abstract: Spinal cord stimulation (SCS) could be used to restore control of the bladder after spinal cord injury, but substantial development is still required to tailor this technology for bladder function. Computational models could be utilized to accelerate these efforts enabling in-silico optimization of stimulation parameters. However, no model of the spinal pudendo-vesical reflex can simulate the effect of stimulation amplitude on neuron recruitment. This limitation hinders accurate prediction of bladder pressure changes for different stimulation configurations. Here, we implemented an open-source realistic spiking neural network model of the pudendo-vesical reflex enabling exploration of the impact of stimulation amplitude and frequency on bladder pressure changes. We used the o2S2PARC platform to design a parallel implementation of the bladder reflex circuits with NEURON. Our model successfully reproduced and expanded previous studies, producing a decrease in bladder pressure at low stimulation frequency (10 Hz) and excitation at high stimulation frequency (≥33 Hz) in isovolumetric experiments. We then explored the effect of mixed nerve recruitment, simulating a common case of poorly selective spinal cord stimulation. We found that high recruitments of pudendal nerve axons are necessary to maintain this bi-modal behavior, regardless of stimulation specificity. Our framework is fully open-source and can be used to simulate any type of axon stimulations such as SCS and peripheral nerve stimulation.
|
|
WeBT5 |
Carron -1 |
Theme 10. General and Theoretical Informatics II |
Oral Session |
Chair: Koutsouris, Dimitrios | Biomedical Engineering Laboratory, School of Electrical and Computer Engineering, National Technical University of Athens |
Co-Chair: Kim, Jeonghee | Texas A&M University |
|
10:30-10:45, Paper WeBT5.1 | |
Encoding Cardiopulmonary Exercise Testing Time Series As Images for Classification Using Convolutional Neural Network |
|
Sharma, Yash | University of Virginia |
Coronato, Nicholas | University of Virginia |
Brown, Donald | University of Virginia |
Keywords: General and theoretical informatics - Machine learning, Sensor Informatics - Wearable systems and sensors, General and theoretical informatics - Supervised learning method
Abstract: Exercise testing has been available for more than a half-century and is a remarkably versatile tool for diagnostic and prognostic information of patients for a range of diseases, especially cardiovascular and pulmonary. With rapid advancements in technology, wearables, and learning algorithm in the last decade, its scope has evolved. Specifically, Cardiopulmonary exercise testing (CPX) is one of the most commonly used laboratory tests for objective evaluation of exercise capacity and performance levels in patients. CPX provides a non-invasive, integrative assessment of the pulmonary, cardiovascular, and skeletal muscle systems involving the measurement of gas exchanges. However, its assessment is challenging, requiring the individual to process multiple time series data points, leading to simplification to peak values and slopes. But this simplification can discard the valuable trend information present in these time series. In this work, we encode the time series as images using the Gramian Angular Field and Markov Transition Field and use it with a convolutional neural network and attention pooling approach for the classification of heart failure and metabolic syndrome patients. Using GradCAMs, we highlight the discriminative features identified by the model.
|
|
10:45-11:00, Paper WeBT5.2 | |
Improving the Factual Accuracy of Abstractive Clinical Text Summarization Using Multi-Objective Optimization |
|
Alambo, Amanuel | Wright State University |
Banerjee, Tanvi | Wright State University |
Thirunarayan, Krishnaprasad | Wright State University |
Cajita, Mia | University of Illinois, Chicago |
Keywords: General and theoretical informatics - Natural language processing, General and theoretical informatics - Supervised learning method, General and theoretical informatics - Big data analytics
Abstract: While there has been recent progress in abstractive summarization as applied to different domains including news articles, scientific articles, and blog posts, the application of these techniques to clinical text summarization has been limited.This is primarily due to the lack of large-scale training data and the messy/unstructured nature of clinical notes as opposed to other domains where massive training data come in structured or semi-structured form. Further, one of the least explored and critical components of clinical text summarization is factual accuracy of clinical summaries. This is specifically crucial in the healthcare domain, cardiology in particular, where an accurate summary generation that preserves the facts in the source notes is critical to the well-being of a patient. In this study, we propose a framework for improving the factual accuracy of abstractive summarization of clinical text using knowledge-guided multi-objective optimization. We propose to jointly optimize three cost functions in our proposed architecture during training: generative loss, entity loss and knowledge loss and evaluate the proposed architecture on 1) clinical notes of patients with heart failure (HF), which we collect for this study; and 2) two benchmark datasets, Indiana University Chest X-ray collection (IU X-Ray), and MIMIC-CXR, that are publicly available. We experiment with three transformer encoder-decoder architectures and demonstrate that optimizing different loss functions leads to improved performance in terms of entity-level factual accuracy.
|
|
11:00-11:15, Paper WeBT5.3 | |
Cerebral Palsy Prediction with Frequency Attention Informed Graph Convolutional Networks |
|
Zhang, Haozheng | Durham University |
Shum, Hubert P. H. | Durham University |
Ho, Edmond S. L. | Northumbria University |
Keywords: General and theoretical informatics - Pattern recognition, General and theoretical informatics - Machine learning, General and theoretical informatics - Deep learning and big data to knowledge
Abstract: Early diagnosis and intervention are clinically considered the paramount part of treating cerebral palsy (CP), so it is essential to design an efficient and interpretable automatic prediction system for CP. We highlight a significant difference between CP infants' frequency of human movement and that of the healthy group, which improves prediction performance. However, the existing deep learning-based methods did not use the frequency information of infants' movement for CP prediction. This paper proposes a frequency attention informed graph convolutional network and validates it on two consumer-grade RGB video datasets, namely MINI-RGBD and RVI-38 datasets. Our proposed frequency attention module aids in improving both classification performance and system interpretability. In addition, we design a frequency-binning method that retains the critical frequency of the human joint position data while filtering the noise. Our prediction performance achieves state-of-the-art research on both datasets. Our work demonstrates the effectiveness of frequency information in supporting the prediction of CP non-intrusively and provides a way for supporting the early diagnosis of CP in the resource-limited regions where the clinical resources are not abundant.
|
|
11:15-11:30, Paper WeBT5.4 | |
Volumetric Measurements Improve the Accuracy of Aortic Remodeling Prediction in Aortic Dissection |
|
Feng, Hanying | Shanghai Jiao Tong University |
Fu, Zheng | Fudan University |
Wang, Yulin | Fudan University |
Lai, Hao | Fudan University |
Zhang, Puming | Shanghai Jiao Tong University |
Keywords: General and theoretical informatics - Predictive analytics, General and theoretical informatics - Machine learning
Abstract: Accessing aortic remodeling status through regular follow-ups is essential for acute type A aortic dissection patients undergone surgical treatment. Aortic remodeling status was usually determined using diameter or area measurements of the true and false lumen in specific anatomical slices of medical images. However, these indicators only represent partial information about the aorta and can hardly characterize the overall aorta situation. In this study, we included two types of morphology features collected from computed tomography angiography images to predict the aortic remodeling. One type is the volumetric measurements of the true and false lumen, which provide a better overall description of the aorta, and the other type is the volumetric measurements of the thrombus in false lumen and the patent false lumen, which present more detailed information of the dissection. Through progressively incorporating these measurements into the construction of the remodeling prediction model, we investigated the importance of the features that describe the overall situation and that characterize aortic internal details in remodeling prediction, especially the effect of quantitative thrombosis features. The results showed that with the inclusion of the two types of volume features, the prediction accuracy of the model increased, which proves that volumetric measurements of aortic dissection, especially the volume of thrombus, are of significant value in aortic remodeling prediction, and should be paid more attention on in clinical practice and research areas.
|
|
11:30-11:45, Paper WeBT5.5 | |
Electrophysiological Differences in Distinct Hearing Threshold Level Individuals with and without Tinnitus Distress |
|
Manta, Ourania | Biomedical Engineering Laboratory, NTUA |
Sarafidis, Michail | National Technical University of Athens |
Schlee, Winfried | University Hospital Regensburg |
Consoulas, Christos | Laboratory of Experimental Physiology, Medical School, National |
Kikidis, Dimitris | National and Kapodistrian University of Athens |
Koutsouris, Dimitrios | Biomedical Engineering Laboratory, School of Electrical and Comp |
Keywords: General and theoretical informatics - Statistical data analysis, Health Informatics - Disease profiling and personalized treatment, Health Informatics - Informatics for chronic disease management
Abstract: Tinnitus is the perception of sound when no actual external noise is present. Tinnitus is highly prevalent, with more than 1 in 7 adults in the EU having tinnitus, and it causes negative effects on quality of life for many individuals. However, there is currently no cure for tinnitus and its pathophysiology and genesis are unknown. Auditory evoked potentials (AEPs) provide a non-invasive means by which the electrical signals evoked by the brain can be recorded, and constitute a useful indicator for the evaluation of auditory disorders such as tinnitus and hearing loss. The present study analyzed a total of 98 auditory middle evoked potential (AMLR) waveforms, a subtype of AEPs, from 49 participants with subjective tinnitus, attempting to identify differences in AMLR parameters between sufferers with and without tinnitus distress. The waveforms were divided into three categories according to the ear’s hearing level, and comparisons were made between sufferers in the same hearing level category. The results of the analysis indicated some statistically significant differences in AMLR latencies and amplitudes between the compared groups.
|
|
11:45-12:00, Paper WeBT5.6 | |
Analytics Pipeline for Visualization of Single Cell RNA Sequencing Data from Brochoaveolar Fluid in COVID-19 Patients: Assessment of Neuro-Fuzzy C-Means and HDBSCAN |
|
Gare, Suman | Indian Institute of Technology Hyderabad |
Chel, Soumita | NA |
Pantula, Priyanka | Indian Institute of Technology, Hyderabad |
Saxena, Abha | Indian Institute of Technology Hyderabad, |
Mitra, Kishalay | Indian Institute of Technology Hyderabad |
Sarkar, Rahuldeb | Medway NHS Foundation Trust |
Giri, Lopamudra | Indian Institute of Technology Hyderabad |
Keywords: General and theoretical informatics - Unsupervised learning method, Bioinformatics - Bioinformatics for health monitoring, Health Informatics - Behavioral health informatics
Abstract: Since the mutation in SARS-COV2 poses new challenges in designing vaccines, it is imperative to develop advanced tools for visualizing the genetic information. Specially, it remains challenging to address the patient-to-patient variability and control severe conditions. In this endeavor we analyze the large-scale RNA-sequencing data collected from bronchoalveolar fluid. In this work, we have used PCA and tSNE for the dimension-reduction. The novelty of the current work is to depict a detailed comparison of k-means, HDBSAN and neuro-fuzzy method in visualization of high-dimension data on gene expression. Clinical Relevance— The subpopulation profiling can be used to study the patient-to patient variability when infected by SARS-COV-2 and its variants. The distribution of cell types can be relevant in designing new drugs that are targeted to control the distribution of epithelial cells, T cells and macrophages.
|
|
WeBT6 |
Carron-2 |
Theme 10. General and Theoretical Informatics - Deep Learning |
Oral Session |
Chair: Ellis, Charles | Georgia Institute of Technology |
Co-Chair: Lu, Haiping | University of Sheffield |
|
10:30-10:45, Paper WeBT6.1 | |
Multi-Agent Feature Selection for Integrative Multi-Omics Analysis |
|
Tabakhi, Sina | University of Sheffield |
Lu, Haiping | University of Sheffield |
Keywords: Bioinformatics - High throughput –omics (genomics, proteomics, metabolomics, lipidomics, and metagenomics) data analytics for precision health, General and theoretical informatics - Machine learning
Abstract: Multi-omics data integration is key for cancer prediction as it captures different aspects of molecular mechanisms. Nevertheless, the high-dimensionality of multi-omics data with a relatively small number of patients presents a challenge for the cancer prediction tasks. While feature selection techniques have been widely used to tackle the curse of dimensionality of multi-omics data, most existing methods have been applied to each type of omics data separately. In this paper, we propose a multi-agent architecture for feature selection, called MAgentOmics, to consider all omics data together. MAgentOmics extends the ant colony optimization algorithm to multi-omics data, which iteratively builds candidate solutions and evaluates them. Moreover, a new fitness function is introduced to assess the candidate feature subsets without using prediction target such as survival time of patients. Therefore, it can be considered as an unsupervised method. We evaluate the performance of MAgentOmics on the TCGA ovarian cancer multi-omics data from 176 patients using a 5-fold cross-validation. The results demonstrate that the integration power of MAgentOmics is relatively better than the state-of-the-art supervised multi-view method. The code is publicly available at https://github.com/SinaTabakhi/MAgentOmics.
|
|
10:45-11:00, Paper WeBT6.2 | |
A Model Visualization-Based Approach for Insight into Waveforms and Spectra Learned by CNNs |
|
Ellis, Charles | Georgia Institute of Technology |
Miller, Robyn | The Tri-Institutional Center for Translational Neuroimaging And |
Calhoun, Vince D | Tri-Institutional Center for Translational Research in Neuroimag |
Keywords: General and theoretical informatics - Deep learning and big data to knowledge, General and theoretical informatics - Machine learning, General and theoretical informatics - Artificial Intelligence
Abstract: Recent years have shown a growth in the application of deep learning architectures such as convolutional neural networks (CNNs), to electrophysiology analysis. However, using neural networks with raw time-series data makes explainability a significant challenge. Multiple explainability approaches have been developed for insight into the spectral features learned by CNNs from EEG. However, across electrophysiology modalities, and even within EEG, there are many unique waveforms of clinical relevance. Existing methods that provide insight into waveforms learned by CNNs are of questionable utility. In this study, we present a novel model visualization-based approach that analyzes the filters in the first convolutional layer of the network. To our knowledge, this is the first method focused on extracting explainable information from EEG waveforms learned by CNNs while also providing insight into the learned spectral features. We demonstrate the viability of our approach within the context of automated sleep stage classification, a well-characterized domain that can help validate our approach. We identify 3 subgroups of filters with distinct spectral properties, determine the relative importance of each group of filters, and identify several unique waveforms learned by the classifier that were vital to the classifier performance. Our approach represents a significant step forward in explainability for electrophysiology classifiers, which we also hope will be useful for providing insights in future studies.
|
|
11:00-11:15, Paper WeBT6.3 | |
CycleDNN - a Novel Deep Neural Network Model for CETSA Feature Prediction Cross Cell Lines |
|
Zeng, Zeng | Institute for Infocomm Research (I2R), Agency for Science, Techn |
Zhao, Shenghao | National University of Singapore |
Da, Qing | National University of Singapore |
Qian, Peisheng | Institute for Infocomm Research (I2R), Agency for Science |
Tam, Wai Leong | Genome Institute of Singapore |
Dai, Lingyun | The First Affiliated Hospital of Southern University of Science |
Nordlund, Pär | Karolinska Institutet, Stockholm |
Prabhu, Nayana | Imcb, A-Star |
Zhao, Ziyuan | Institute for Infocomm Research (I2R), Agency for Science, Techn |
Yang, Xulei | Institute for Infocomm Research, A*STAT |
Keywords: General and theoretical informatics - Deep learning and big data to knowledge, Bioinformatics - Bioinformatics databases, General and theoretical informatics - Big data analytics
Abstract: Cellular Thermal Shift Assay (CETSA) has been widely used in drug discovery, cancer cell biology, immunology, etc. One of the barriers for CETSA applications is that CETSA experiments have to be conducted on various cell lines, which is extremely time-consuming and costly. In this study, we make an effort to explore the translation of CETSA features cross cell lines, i.e., known CETSA feature of a given protein in one cell line, can we automatically predict the CETSA feature of this protein in another cell line, and vice versa? Inspired by pix2pix and CycleGAN, which perform well on image-toimage translation cross various domains in computer vision, we propose a novel deep neural network model called CycleDNN for CETSA feature translation cross cell lines. Given cell lines A and B, the proposed CycleDNN consists of two auto-encoders, the first one encodes the CETSA feature from cell line A into Z in the latent space Z, then decodes Z into the CETSA feature in cell line B. Similarly, the second one translates the CETSA feature from cell line B to cell line A through the latent space Z ′ . In such a way, the two auto-encoders form a cyclic feature translation between cell lines. The reconstructed loss, cycleconsistency loss, and latent vector regularization loss are used to guide the training of the model. The experimental results on a public CETSA dataset demonstrate the effectiveness of the proposed approach.
|
|
11:15-11:30, Paper WeBT6.4 | |
DeepPulse: An Uncertainty-Aware Deep Neural Network for Heart Rate Estimations from Wrist-Worn Photoplethysmography |
|
Ray, Daniel | Manchester Metropolitan University |
Collins, Tim | Manchester Metropolitan University |
Ponnapalli, Prasad V. S. | Manchester Metropolitan University |
Keywords: General and theoretical informatics - Deep learning and big data to knowledge, Sensor Informatics - Multi-sensor data fusion, Health Informatics - Quality of service, trust, security
Abstract: Wearable Photoplethysmography (PPG) has gained prominence as a low cost, unobtrusive and continuous method for physiological monitoring. The quality of the collected PPG signals is affected by several sources of interference, predominantly due to physical motion. Many methods for estimating heart rate (HR) from PPG signals have been proposed with Deep Neural Networks (DNNs) gaining popularity in recent years. However, the “black-box” and complex nature of DNNs has caused a lack of trust in the predicted values. This paper contributes DeepPulse, an uncertainty-aware DNN method for estimating HR from PPG and accelerometer signals, with aims of increasing the reliability, usability and interpretability of the predicted HR values. To the best of the authors’ knowledge no PPG HR estimation method has considered aleatoric and epistemic uncertainty metrics. The results show DeepPulse is the most accurate method for DNNs with smaller network sizes. Finally, recommendations are given to reduce epistemic uncertainty, validate uncertainty estimates, improve the accuracy of DeepPulse as well as reduce the model size for resource-constrained edge devices.
|
|
11:30-11:45, Paper WeBT6.5 | |
LTH-ECG: Lottery Ticket Hypothesis-Based Deep Learning Model Compression for Atrial Fibrillation Detection from Single Lead ECG on Wearable and Implantable Devices |
|
Sahu, Ishan | Tata Consultancy Services |
Ukil, Arijit | TATA Consultancy Services |
khandelwal, sundeep | Tata Consultancy Services |
Pal, Arpan | Tata Consultancy Services |
Keywords: General and theoretical informatics - Deep learning and big data to knowledge, Health Informatics - internet of things in healthcare, Sensor Informatics - Wearable systems and sensors
Abstract: Atrial Fibrillation (AF) is a kind of arrhythmia, which is a major morbidity factor, and AF can lead to stroke, heart failure and other cardiovascular complications. Electrocardiogram (ECG) is the basic marker to test the condition of heart and it can effectively detect AF condition. Single lead ECG has the practical advantage for being small form factor and it is easy to deploy. With the sophistication of the current deep learning (DL) models, researchers have been able to construct cardiologist-level models to detect different arrhythmias including AF condition detection from single lead short-time ECG signals. However, such models are computationally expensive and require huge memory size for deployment (more than 100 MB to deploy state-of-the-art 34-layer convolutional neural network-based ECG classification model). Such models need to be significantly trimmed with insignificantly loss of its classification performance for deployment in practical applications like single lead ECG classification in wearable and implantable devices. We have found that classical deep learning model compression techniques like pruning, quantization are not capable of substantial model size reduction without compromising on the model performance. In this paper, we propose LTH-ECG, which is our novel goal-driven winning lottery ticket discovery method, where lottery ticket hypothesis (LTH)-based iterative model pruning is used with the aim of over-pruning avoidance. LTH-ECG reduces the model size by 142x times with insignificant loss of classification performance (less than 1% test F1-score penalty).
|
|
11:45-12:00, Paper WeBT6.6 | |
CETSA Feature Based Clustering for Protein Outlier Discovery by Protein-To-Protein Interaction Prediction |
|
Yang, Xulei | Institute for Infocomm Research, A*STAT |
Da, Qing | National University of Singapore |
Qian, Peisheng | Institute for Infocomm Research (I2R), Agency for Science |
Veeravalli, Bharadwaj | National University of Singapore |
Tam, Wai Leong | Genome Institute of Singapore |
Dai, Lingyun | The First Affiliated Hospital of Southern University of Science |
Nordlund, Pär | Karolinska Institutet, Stockholm |
Prabhu, Nayana | Imcb, A-Star |
Zhao, Ziyuan | Institute for Infocomm Research (I2R), Agency for Science, Techn |
Zeng, Zeng | Institute for Infocomm Research (I2R), Agency for Science, Techn |
Keywords: General and theoretical informatics - Machine learning, General and theoretical informatics - Big data analytics, Bioinformatics - Bioinformatics databases
Abstract: The Cellular Thermal Shift Assay (CETSA) is a biophysical assay based on the principle of ligand-induced thermal stabilization of target proteins. This technology has revolutionized cell-based target engagement studies and has been used as guidance for drug design. Although many applications of CETSA data have been explored, the correlations between CETSA data and protein-protein interactions (PPI) have barely been touched. In this study, we conduct the first exploration study applying CETSA data for PPI prediction. We use a machine learning method, Decision Tree, to predict PPI scores using proteins’ CETSA features. It shows promising results that the predicted PPI scores closely match the groundtruth PPI scores. Furthermore, for a small number of protein pairs, whose PPI score predictions mismatch the ground truth, we use iterative clustering strategy to gradually reduce the number of these pairs. At the end of iterative clustering, the remaining protein pairs may have some unusual properties and are of scientific value for further biological investigation. Our study has demonstrated that PPI is a brand-new application of CETSA data. At the same time, it also manifests that CETSA data can be used as a new data source for PPI exploration study.
|
|
WeBT9 |
Gala |
Theme 02. Image Analysis and Classification - Machine Learning / Deep
Learning Approaches - II |
Oral Session |
Chair: Loizidou, Kosmia | University of Cyprus and KIOS Research and Innovation Center of Excellence |
|
10:30-10:45, Paper WeBT9.1 | |
Spatio-Temporal Causal Transformer for Multi-Grained Surgical Phase Recognition |
|
Chen, Huabin | Institute of Automation, Chinese Academy of Sciences |
Li, Zhen | Institute of Automation, Chinese Academy of Sciences |
Fu, Pan | School of Automation, Beijing Information Science and Technology |
Ni, ZhenLiang | Institute of Automation, Chinese Academy of Sciences |
Bian, Gui-Bin | Institute of Automation, Chinese Academy of Sciences |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Optical imaging and microscopy - Microscopy
Abstract: Automatic surgical phase recognition plays a key role in surgical workflow intelligent analysis and overall optimization in clinical work. In the complicated surgical procedures, similar inter-class appearance and drastic variability in phase duration make this still a challenging task. In this paper, a spatio-temporal transformer is proposed for online surgical phase recognition with different granularity. To extract richer spatial information, a spatial transformer is used to model global spatial dependencies of each time index. To overcome the variability in phase duration, a temporal transformer captures the multi-scale temporal context of different time indexes with a dual pyramid pattern. Our method is thoroughly validated on the public Cholec80 dataset with 7 coarse-grained phases and CATARACTS2020 dataset with 19 fine-grained phases, outperforming state-of-the-art approaches with 91.4% and 84.2% accuracy, taking only 24.5M parameters.
|
|
10:45-11:00, Paper WeBT9.2 | |
Identification and Classification of Benign and Malignant Masses Based on Subtraction of Temporally Sequential Digital Mammograms |
|
Loizidou, Kosmia | University of Cyprus and KIOS Research and Innovation Center Of |
Skouroumouni, Galateia | Nicosia General Hospital |
Savvidou, Gabriella | Medical School University of Cyprus, Bank of Cyprus Oncology Cen |
Constantinidou, Anastasia | Medical School University of Cyprus, Bank of Cyprus Oncology Cen |
Nikolaou, Christos | Limassol General Hospital |
Pitris, Costas | University of Cyprus |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, X-ray imaging applications, Image enhancement
Abstract: Breast cancer remains the leading cause of cancer deaths and the second highest cause of death, in general, among women worldwide. Fortunately, over the last few decades, with the introduction of mammography, the mortality rate of breast cancer has significantly decreased. However, accurate classification of breast masses in mammograms is especially challenging. Various Computer-Aided Diagnosis (CAD) systems are being developed to assist radiologists with the accurate classification of breast abnormalities. In this study, classification of benign and malignant masses, based on the subtraction of temporally sequential digital mammograms and machine learning, is proposed. The performance of the algorithm was evaluated on a dataset created for the purposes of this study. In total, 196 images from 49 patients, with precisely annotated mass locations and biopsy confirmed malignant cases, were included. Ninety-six features were extracted and five feature selection algorithms were employed to identify the most important features. Ten classifiers were tested using leave-one-patient-out and 7-fold cross-validation. Neural Networks, achieved the highest classification performance with 90.85% accuracy and 0.91 AUC, an improvement compared to the state-of-the-art. These results demonstrate the effectiveness of the subtraction of temporally consecutive mammograms for the classification of breast masses as benign or malignant.
|
|
11:00-11:15, Paper WeBT9.3 | |
A Non-Aligned Deep Representation to Enhance Standard Colonoscopy Observations from Vascular Narrow Band Polyp Patterns |
|
Sierra-Jerez, Franklin | Biomedical Imaging, Vision and Learning Laboratory (BivL2ab). Un |
Ruiz, Jair | Instituto De Gastroenterología Y Hepatología Del Oriente IGHO S |
Martinez, Fabio | Universidad Industrial De Santander |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image feature extraction, Image classification
Abstract: Colorectal cancer (CRC) was responsible during 2020 for about one million deaths worldwide. Polyps are protuberance masses, observed in routine colonoscopies, that constitute the main CRC biomarker. Nonetheless, one of the best alternatives to the polyp malignancy classification is the vascular pattern analysis, typically observed from specialized narrow-band images (NBI). Even worst, these patterns are only characterized from gastroenterologist observations, introducing subjectivity and being prone to diagnostic errors, with misclassifications ranging from 59.5% to 84.2%. This work introduces a non-aligned and bi-directional deep projection between optical colonoscopy (OC) and NBI sequences, to recover enhanced OC sequences, integrating vascular patterns, that allow better discrimination among adenomas, hyperplastic and serrated polyps. This self-supervised representation help with misclassification in standard OC observations. The validation was performed on a total of 76 OC and 76 NBI sequences, achieving a gain of 22.34% w.r.t descriptors computed from raw OC.
|
|
11:15-11:30, Paper WeBT9.4 | |
Transfer Learning for Automated COVID-19 B-Line Classification in Lung Ultrasound |
|
Pare, Joseph | Boston Medical Center |
Gjesteby, Lars | MIT Lincoln Laboratory |
Telfer, Brian | MIT Lincoln Laboratory |
Tonelli, Melinda | Boston Medical Center |
Leo, Megan | Boston Medical Center |
Billatos, Ehab | Boston Medical Center |
scalera, Jonathan | Boston Medical Center |
Brattain, Laura | MIT Lincoln Laboratory |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image classification, Ultrasound imaging - Other organs
Abstract: Lung ultrasound (LUS) as a diagnostic tool is gaining support for its role in the diagnosis and management of COVID-19 and a number of other lung pathologies. B-lines are a predominant feature in COVID-19, however LUS requires a skilled clinician to interpret findings. To facilitate the interpretation, our main objective was to develop automated methods to classify B-lines as pathologic vs. normal. We developed transfer learning models based on ResNet networks to classify B-lines as pathologic (at least 3 B-lines per lung field) vs. normal using COVID-19 LUS data. Assessment of B-line severity on a 0-4 multi-class scale was also explored. For binary B-line classification, at the frame-level, all ResNet models pretrained with ImageNet yielded higher performance than the baseline nonpretrained ResNet-18. Pretrained ResNet-18 has the best Equal Error Rate (EER) of 9.1% vs the baseline of 11.9%. At the clip-level, all pretrained network models resulted in better Cohen’s kappa agreement (linear-weighted) and clip score accuracy, with the pretrained ResNet-18 having the best Cohen’s kappa of 0.815 [95% CI: 0.804-0.826], and ResNet-101 the best clip scoring accuracy of 93.6%. Similar results were shown for multi-class scoring, where pretrained network models outperformed the baseline model. A class activation map is also presented to guide clinicians in interpreting LUS findings. Future work aims to further improve the multi-class assessment for severity of B-lines with a more diverse LUS dataset.
|
|
11:30-11:45, Paper WeBT9.5 | |
Multimodal Contrastive Supervised Learning to Classify Clinical Significance MRI Regions on Prostate Cancer |
|
Gutiérrez, Yesid | Biomedical Imaging, Vision and Learning Laboratory (BivL2ab). Un |
Arevalo, John | Universidad Nacional De Colombia |
Martinez, Fabio | Universidad Industrial De Santander |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Multimodal image fusion, Magnetic resonance imaging - Other organs
Abstract: Clinically significant regions (CSR), captured over multi-parametric MRI (mp-MRI) images, have emerged as a potential screening test for early prostate cancer detection and characterization. These sequences are able to quantify morphology, micro-circulation, and cellular density patterns that might be related to cancer disease. Nonetheless, this evaluation is mainly carried out by expert radiologists, introducing inter-reader variability in the diagnosis. Therefore, different deep learning models were proposed to support the diagnosis, but a proper representation of prostate lesions remains limited due to the non-alignment among sequences and the dependency of considerable amounts of labeled data for learning. The main limitation of such representation lies in the cross-entropy minimization that only exploits inter-class variation, being insufficient data augmentation and transfer learning strategies. This work introduces a Supervised Contrastive Learning (SCL) strategy that fully exploits the inter and intra-class variability of prostate lesions to robustly represent MRI regions. This strategy extracts lesion sample tuples, with positive and negative labels, regarding a query lesion. Such tuples are involved into an easy-positive, and semi-hard negative mining to project samples that better update the deep representation. The proposed learning strategy achieved an average ROC-AUC of 0.82, to characterize prostate cancer in MRI, using only the 60% of the available annotated data. newline indent textit{Clinical relevance}— A robust learning scheme that properly finds representations in limited data scenarios to classify clinically significant MRI regions on prostate cancer.
|
|
WeBT10 |
Forth |
Theme 02. Magnetic Resonance Imaging - Cardiac Imaging |
Oral Session |
Chair: Zhao, Bo | University of Texas at Austin |
Co-Chair: Wang, Ze | University of Maryland Baltimore |
|
10:30-10:45, Paper WeBT10.1 | |
Detection and Classification of Myocardial Infarction Transmurality Using Cardiac MR Image Analysis and Machine Learning Algorithms |
|
Hernández-Casillas, Andrea | Universitat Politècnica De València |
Del-Canto, Irene | Universitat Politècnica De València |
Ruiz-España, Silvia | Universitat Politècnica De València |
López-Lereu, María P. | ERESA |
Monmeneu, José V. | ERESA |
Moratal, David | Universitat Politècnica De València |
Keywords: Magnetic resonance imaging - Cardiac imaging, Image feature extraction, Machine learning / Deep learning approaches
Abstract: The presence of abnormalities when the left ventricle is deformed is related to the patients’ prognosis after a first myocardial infarction. These deformations can be detected by performing a cardiac magnetic resonance (CMR) study. Currently, late gadolinium enhancement (LGE) is considered to be the gold standard when performing CMR imaging. However, CMR with LGE overestimates infarct size and underestimates recovery of dysfunctional segments after myocardial infarction. Based on this statement, the objective is to detect, characterize, and quantify the extent of myocardial infarction in patients with cardiac pathologies, using parameters derived from CMR, in order to obtain greater precision in patients’ recovery predictions than when only studying LGE images. For this purpose, we studied the infarct presence and extension from a total of 105 images from 35 patients, and calculated myocardium strain and torsion to characterize and quantify the affected tissue. A total of twenty-one parameters were selected to create predictive models. Moreover, we compared two feature extraction methods, and the performance of five machine learning algorithms. Results show that both temporal and strain parameters are the most relevant to detect and characterize the extent of myocardial infarction. The use of imaging techniques and machine learning algorithms have great potential and show promising results when it comes to detecting the presence and extent of myocardial infarction.
|
|
10:45-11:00, Paper WeBT10.2 | |
Adiabatic Spin-Lock Preparations Enable Robust in Vivo Cardiac T1ρ-Mapping at 3T |
|
Coletti, Chiara | TU Delft |
Tourais, Joao | Delft University of Technology |
Ploem, Telly | Delft University of Technology |
van de Steeg-Henzen, Christal | HollandPTC |
Akcakaya, Mehmet | University of Minnesota |
Weingärtner, Sebastian | Delft University of Technology |
Keywords: Magnetic resonance imaging - Cardiac imaging, Magnetic resonance imaging - Pulse sequence, Cardiac imaging and image analysis
Abstract: Magnetic Resonance Imaging (MRI) is the clinical gold standard for assessment of myocardial viability, but requires injection of exogenous Gadolinium-based contrast agents. Recently, T1ρ-mapping has been proposed as a fully non-invasive alternative for imaging myocardial fibrosis without the need for contrast agent injection. However its applicability at high fields is hindered by susceptibility to MRI system imperfections, such as inhomogeneities in the B0and B+1 fields. In this work we propose a single breath-hold ECG-triggered single-shot bSSFP sequence to enable T1ρ-mapping in-vivo at 3T. Adiabatic T1ρ preparations are evaluated to reduce B0 and B+1 sensitivity in comparison with conventional spin-lock modules. Numerical Bloch simulations were performed to identify optimal parameters for the adiabatic pulses. Experiments yield T1ρ values in the myocardium equal to 148.13 ± 54.08 ms for the best adiabatic preparation and 16.01 ± 20.75 for the reference non-adiabatic spin-lock, with 26.91% against 89.74% relative difference in T1ρ values across two shimming conditions. Both phantom and in vivo measurements show increased myocardium/blood contrast and improved resilience against system imperfections compared with non-adiabatic T1ρ preparations, enabling the use at 3T. Adiabatically-preparedT1ρ-mapping sequences form a promising candidate for non-contrast evaluation of ischemic and non-ischemic cardiomyopathies at 3T.
|
|
11:00-11:15, Paper WeBT10.3 | |
Myocardial Approximate Spin-Lock Dispersion Mapping Using a Simultaneous T2 and TRAFF2 Mapping at 3T MRI |
|
Tourais, Joao | Delft University of Technology |
Demirel, Omer Burak | University of Minnesota |
Tao, Qian | Leiden University Medical Center |
Pierce, Iain | Royal Brompton Hospital |
Thornton, George D | Barts Heart Centre, Barts Health NHS Trust, West Smithfield, Lon |
Treibel, Thomas A | Barts Heart Centre, Barts Health NHS Trust, West Smithfield, Lon |
Akcakaya, Mehmet | University of Minnesota |
Weingärtner, Sebastian | Delft University of Technology |
Keywords: Magnetic resonance imaging - Cardiac imaging, Magnetic resonance imaging - Pulse sequence, Image reconstruction and enhancement - Parametric image reconstruction
Abstract: Ischemic heart disease (IHD) is one of the leading causes of death worldwide. Myocardial infarction (MI) makes up a third of all IHD cases, and cardiac magnetic resonance imaging (MRI) is often used to assess its damage to myocardial viability. Late gadolinium enhancement (LGE) is the current gold standard, but the use of gadolinium-based agents limits the clinical applicability in some patients. Spin-lock (SL) dispersion has recently been proposed as a promising non-contrast biomarker for the assessment of MI. However, at 3T, the required range of SL preparations acquired at different amplitudes suffers from specific absorption rate (SAR) limitations and off-resonance artifacts. Relaxation Along a Fictitious Field (RAFF) is an alternative to SL preparations with lower SAR requirements, while still sampling relaxation in the rotating frame. In this study, a single breath-hold simultaneous TRAFF2 and T2 mapping sequence is proposed for SL dispersion mapping at 3T. Excellent reproducibility (coefficient of variations lower than 10%) was achieved in phantom experiments, indicating good intrascan repeatability. The average myocardial TRAFF2, T2, and SL dispersion obtained with the proposed sequence (68.0 ± 10.7 ms, 44.0 ± 4.0 ms, and 0.4 ± 0.2 x 10-4 s2, respectively) were comparable to the reference methods (62.7 ± 11.7 ms, 41.2 ± 2.4 ms, and 0.3 ± 0.2 x 10-4 s2, respectively). High visual map quality, free of B0 and B1 related artifacts, for T2, TRAFF2, and SL dispersion maps was obtained in phantoms and in vivo, suggesting promise in clinical use at 3T.
|
|
11:15-11:30, Paper WeBT10.4 | |
3D Cardiac Substructures Segmentation from CMRI Using Generative Adversarial Network (GAN) |
|
Kanakatte, Aparna | Tata Consultancy Services |
Bhatia, Divya | TCS-Research and Innovation |
Ghose, Avik | TCS Research & Innovation |
Keywords: Magnetic resonance imaging - Cardiac imaging, Image segmentation, Cardiac imaging and image analysis
Abstract: Cardiac magnetic resonance imaging (CMRI) improves the diagnosis of cardiovascular diseases by providing images at high spatio-temporal resolution helping physicians in providing correct treatment plans. Segmentation and identification of various substructures of the heart at different cardiac phases of end-systole and end-diastole helps in the extraction of ventricular function information such as stroke volume, ejection fraction, myocardium thickness, etc. Manual delineation of the substructures is tedious, time-consuming, and error-prone. We have implemented a 3D GAN that includes 3D contextual information capable of segmenting and identifying the substructures at different cardiac phases with improved accuracy. Our method is evaluated on the ACDC dataset (4 pathologies, 1 healthy group) to show that the proposed method outperforms other methods in literature with less amount of data. Also, the proposed provided a better Dice score in segmentation surpassing other methods on a blind-tested M&Ms dataset.
|
|
11:30-11:45, Paper WeBT10.5 | |
Automated 3D Whole-Heart Mesh Reconstruction from 2D Cine MR Slices Using Statistical Shape Model |
|
Banerjee, Abhirup | University of Oxford |
Zacur, Ernesto | Oxford University |
Choudhury, Robin P. | University of Oxford |
Grau, Vicente | University of Oxford |
Keywords: Magnetic resonance imaging - Cardiac imaging, Iterative image reconstruction, Regularized image Reconstruction
Abstract: Cardiac magnetic resonance (CMR) imaging is the one of the gold standard imaging modalities for the diagnosis and characterization of cardiovascular diseases. The clinical cine protocol of the CMR typically generates high-resolution 2D images of heart tissues in a finite number of separated and independent 2D planes, which are appropriate for the 3D reconstruction of biventricular heart surfaces. However, they are usually inadequate for the whole-heart reconstruction, specifically for both atria. In this regard, the paper presents a novel approach for automated patient-specific 3D whole-heart mesh reconstruction from limited number of 2D cine CMR slices with the help of a statistical shape model (SSM). After extracting the heart contours from 2D cine slices, the SSM is first optimally fitted over the sparse heart contours in 3D space to provide the initial representation of the 3D whole-heart mesh, which is further deformed to minimize the distance from the heart contours for generating the final reconstructed mesh. The reconstruction performance of the proposed approach is evaluated on a cohort of 30 subjects randomly selected from the UK Biobank study, demonstrating the generation of high-quality 3D whole-heart meshes with average contours to surface distance less than the underlying image resolution and the clinical metrics within acceptable ranges reported in previous literature.
|
|
11:45-12:00, Paper WeBT10.6 | |
A 3D Convolutional Neural Network with Gradient Guidance for Image Super-Resolution of Late Gadolinium Enhanced Cardiac MRI |
|
Upendra, Roshan Reddy | Rochester Institute of Technology |
Linte, Cristian A. | Rochester Institute of Technology |
Keywords: Magnetic resonance imaging - Cardiac imaging, Image reconstruction and enhancement - Machine learning / Deep learning approaches
Abstract: In this paper, we describe a 3D convolutional neural network (CNN) framework to compute and generate super-resolution late gadolinium enhanced (LGE) cardiac magnetic resonance imaging (MRI) images. The proposed CNN framework consists of two branches: a super-resolution branch with a 3D dense deep back-projection network (DBPN) as the backbone to learn the mapping of low-resolution LGE cardiac volumes to high-resolution LGE cardiac volumes, and a gradient branch that learns the mapping of the gradient map of low resolution LGE cardiac volumes to the gradient map of their high-resolution counterparts. The gradient branch of the CNN provides additional cardiac structure information to the super-resolution branch to generate structurally more accurate super-resolution LGE MRI images. We conducted our experiments on the 2018 atrial segmentation challenge dataset. The proposed CNN framework achieved a mean peak signal-to-noise ratio (PSNR) of 30.91 and 25.66 and a mean structural similarity index measure (SSIM) of 0.91 and 0.75 on training the model on low-resolution images downsampled by a scale factor of 2 and 4, respectively.
|
|
WeBT12 |
M1 |
Theme 06. Brain Stimulation |
Oral Session |
Chair: Johansson, Johannes | Linköping University |
Co-Chair: Connolly, Mark | Yerkes National Primate Research Center |
|
10:30-10:45, Paper WeBT12.1 | |
Cross-Modal Activation of the Primary Visual Cortex by Auditory Stimulation in RCS Rats: Considerations in Visual Prosthesis |
|
Caravaca-Rodriguez, Daniel | Universidad De Sevilla |
Suaning, Gregg | The University of Sydney |
Barriga-Rivera, Alejandro | Universidad De Sevilla |
Keywords: Sensory neuroprostheses, Sensory neuroprostheses - Visual, Sensory neuroprostheses - Auditory
Abstract: An important brain re-wiring, the so-called cross-modal plasticity, occurs during progression of retinal degenerative diseases to compensate for lack of visual input. The visual cortex does not go ‘unused’, instead it is devoted to processing other sensory modalities. In this study we recorded, in the visual cortex, visual- and auditory-evoked potentials in an anesthetized murine model of retinal degeneration. The latency to the first peak of the recorded local field potentials was used to assess the speed of the response. Visual responses occurred significantly faster in the control group. Conversely, auditory responses appeared significantly faster in animals with retinal degeneration. This suggests the compensatory neural rewiring is optimizing the performance of other sensory modalities, hearing in this case. This phenomenon may play an important role in visual neuro-rehabilitation. Whether or not it can promote or deter the interpretation of artificially encoded neural signals from a visual prosthesis remains to be studied.
|
|
10:45-11:00, Paper WeBT12.2 | |
Relative Comparison of Non-Invasive Brain Stimulation Methods for Modulating Deep Brain Targets |
|
Radyte, Emile | University of Oxford |
Wendt, Karen | University of Oxford |
Memarian Sorkhabi, Majid | University of Oxford |
O'Shea, Jacinta | University of Oxford |
Denison, Timothy | University of Oxford |
Keywords: Neural stimulation - Deep brain, Brain physiology and modeling, Neurological disorders - Psychiatric disorders
Abstract: This study models and investigates whether temporally interfering electric fields (TI EFs) could function as an effective non-invasive brain stimulation (NIBS) method for deep brain structure targeting in humans, relevant for psychiatric applications. Here, electric fields off- and on-target are modelled and compared with other common NIBS modalities (tACS, TMS). Additionally, local effects of the field strength are modelled on single-compartment neuronal models. While TI EFs are able to effectively reach deep brain targets, the ratio of off- to on-target stimulation remains high and comparable to other NIBS and may result in off-target neural blocks. Clinical Relevance— This study builds on earlier work and demonstrates some of the challenges –such as off-target conduction blocks– of applying TI EFs for targeting deep brain structures important in understanding the potential of treating neuropsychiatric conditions in the future.
|
|
11:00-11:15, Paper WeBT12.3 | |
DBSim and ELMA – Freeware for Simulations of Deep Brain Stimulation |
|
Johansson, Johannes | Linköping University |
Wardell, Karin | Linkoping University |
Keywords: Neural stimulation - Deep brain, Brain physiology and modeling, Neurological disorders
Abstract: Abstract— Finite Element Method (FEM) simulations of the electric field is a useful tool to estimate the activated tissue around Deep Brain Stimulation (DBS) electrodes. Based on our previous research, a two-part software package named DBSim and ELMA is presented. ELMA is used to classify brain tissue into grey matter, white matter, blood, and cerebrospinal fluid and assign electric conductivities accordingly. This data is then used in DBSim to generate patient-specific simulations of the electric field around currently implemented leads Medtronic 3387 and 3389, and Abbott 6180 and 6181. The software is available for free download at https://liu.se/en/article/ne-downloads Clinical Relevance— This is a tool meant for research and educational purposes for e.g., studies on optimal target areas for DBS.
|
|
11:15-11:30, Paper WeBT12.4 | |
Intended and Actual Orientations of Directional Deep Brain Stimulation Leads Vary across Deep Brain Stimulation Systems |
|
Henry, Kaylee | Northwestern University |
Miulli, Milina | Northwestern University |
Elahi, Behzad | Northwestern University, Department of Physical Therapy and Huma |
Rosenow, Joshua | Northwestern University |
Nolt, Mark J. | Northwestern University |
Rad, Laleh Golestani | Northwestern University |
Keywords: Neural stimulation - Deep brain, Neural stimulation
Abstract: Deep brain stimulation (DBS) offers therapeutic benefits to patients suffering from treatment-resistant movement and neurological disorders. The newest generation of DBS devices utilizes segmented electrodes to direct current asymmetrically to the surrounding neuronal tissue. Since segmented electrodes offer a larger number of contacts for individualized patient-specific programming, it is becoming more critical to ensure that the surgically intended and actual orientation of the lead match to avoid adverse side effects caused by stimulation of specific nuclei. Postoperative image analysis algorithms, such as DiODe in Lead-DBS, are commonly used to determine DBS leads' actual orientation. However, there has yet to be a comparison of the deviation between intended and actual orientations across the most commonly implanted directional DBS systems: Boston Scientific Cartesia™ and St. Jude Medical Infinity. This study is the first to statistically analyze the rotation of 126 leads from both DBS systems.
|
|
11:30-11:45, Paper WeBT12.5 | |
Meta-Bayesian Optimization for Deep Brain Stimulation |
|
Connolly, Mark | Yerkes National Primate Research Center |
Opri, Enrico | Emory University |
Miocinovic, Svjetlana | Emory University |
Devergnas, Annaelle | Yerkes National Primate Research Center |
Keywords: Neural stimulation, Neural stimulation - Deep brain, Smart neural implants - Neurostimulation
Abstract: Deep brain stimulation (DBS) is becoming a fundamental tool for the treatment and study of neurological and psychiatric diseases and disorders. Recently developed DBS devices and electrodes have allowed for more flexible and precise stimulation. Densely packed stimulation contacts can be independently stimulated to shape the electric field, targeting pathways of interest, and avoiding those that may cause side-effects. However, this flexibility comes at a cost. Each additional stimulation setting causes an exponential increase in the number of potential stimulation settings. Recent works have addressed this problem using Bayesian optimization. However, this approach has a limited ability to learn from multiple subjects to improve performance. In this study we extend a recently developed meta-Bayesian optimization algorithm to the DBS domain. We evaluated this approach compared to classical Bayesian optimization and a random search using data collected from a nonhuman primate during stimulation of the subthalamic nucleus while recording evoked potentials in the motor cortex and locally within the subthalamic nucleus. On the task of finding the stimulation setting that maximized the evoked potential across a distribution of generated objective functions, meta-Bayesian optimization significantly outperformed the other approaches with a cumulative reward of 8.93±0.70, compared to 7.17±1.64 for Bayesian optimization (p < 10-9) and 6.89±1.56 for the random search (p < 10-9). Moreover, the algorithm outperformed Bayesian optimization when tested on an objective function not used during training. These results demonstrate that meta-Bayesian optimization can take advantage of the structure underlying a distribution of objective function and learn an optimal search strategy that can generalize beyond the objective functions that were not part of the training data.
|
|
11:45-12:00, Paper WeBT12.6 | |
Automated Tuning of Closed-Loop Neuromodulation Control Systems Using Bayesian Optimization |
|
Sarikhani, Parisa | Emory University |
Hsu, Hao-Lun | Georgia Institute of Technology |
Mahmoudi, Babak | Emory University |
Keywords: Neural stimulation - Deep brain, Neurorehabilitation, Neural signals - Machine learning & Classification
Abstract: Tuning the parameters of controllers to attain the best performance is a challenging task in designing effective closed-loop neuromodulation systems. In this paper, we present a distributed architecture for automated tuning and adaptation of closed-loop neuromodulation control systems. We use this approach for the automated parameter tuning of a Proportional-Integral (PI) neuromodulation controller using Bayesian optimization. We use a biophysically-grounded mean-field model of neural populations under electrical stimulation as a simulation environment for testing and prototyping the proposed framework and characterizing its performance. Our results demonstrate the feasibility of using Bayesian optimization for performance-based automated tuning of a PI controller in closed-loop set-point neuromodulation control tasks.
|
|
WeCT4 |
Boisdale-2 |
Theme 01. Signal Processing and Classification of Cardiovascular Signals |
Oral Session |
Chair: Telfer, Brian | MIT Lincoln Laboratory |
|
14:00-14:15, Paper WeCT4.1 | |
A Comparison between Wavelet Scattering Transform and Transfer Learning for Elevated Blood Pressure Detection |
|
Martinez Rios, Erick Axel | Tecnologico De Monterrey |
Montesinos, Luis | Tecnologico De Monterrey |
Alfaro Ponce, Mariel | Tecnológico De Monterrey |
Keywords: Time-frequency and time-scale analysis - Wavelets, Signal pattern classification, Neural networks and support vector machines in biosignal processing and classification
Abstract: Hypertension is a health issue whose late diagnosis could lead to renal, cerebral, and cardiac events. In this work, it is proposed to use the wavelet scattering transform (WST) as a feature extraction technique applying classical machine learning techniques using photoplethysmography (PPG) signals as input to detect elevated blood pressure and compare its performance with transfer learning applied through fine-tuned convolutional neural networks. The results show that the features obtained by applying the WST and training a logistic regression and support vector machine produced similar results in terms of accuracy compared to fine-tuned convolutional neural networks, with the advantage that the WST could be used to generate a white-box model, which is better suited for a potential medical diagnosis application.
|
|
14:15-14:30, Paper WeCT4.2 | |
Intracranial Pressure Pulse Morphology-Based Definition of Life-Threatening Intracranial Hypertension Episodes |
|
Mataczyński, Cyprian | Wrocław University of Science and Technology |
Kazimierska, Agnieszka | Wrocław University of Science and Technology |
Uryga, Agnieszka | Wroclaw University of Science and Technology |
Kasprowicz, Magdalena | Wroclaw University of Technology |
Keywords: Data mining and big data methods - Patient outcome and risk analysis, Physiological systems modeling - Signal processing in physiological systems, Neural networks and support vector machines in biosignal processing and classification
Abstract: Intracranial hypertension (IH) is associated with poor outcome in traumatic brain injury (TBI) patients and must be avoided to prevent secondary brain injury. In clinical practice the most common method of IH detection is the calculation of the mean value of intracranial pressure (ICP) and the therapeutic intervention is usually introduced when the mean exceeds a certain threshold. This threshold, however, is rather individual for each patient than universal for all. It is well known that impaired cerebrovascular reactivity and reduced intracranial compliance are associated with raised ICP. This work explores a new definition of life-threatening hypertension (LTH) which accounts for the state of cerebral compliance. In the proposed method, changes in compliance are analysed through identification of likely pathological and/or pathological shapes of ICP pulse waveforms using a neural network. In terms of predictive power for mortality in TBI, detection of both shape clasess of ICP pulse waveforms during raised ICP offers similar results to previously proposed LTH definition accounting for the state of cerebrovascular reactivity (77.8% vs 76.9% accuracy, respectively). On the other hand, the fully pathological shapes of ICP pulses are present during ICP rises almost only in recordings of patients who died: out of 216 analysed patients only 6% of surviving and as many as 42% of deceased patients developed this type of LTH event. The stricter definition of LTH events including only pathological shape of ICP pulses presents the highest accuracy among the analysed approaches for mortality prediction (87.9%).
|
|
14:30-14:45, Paper WeCT4.3 | |
Feature Importance Analysis for Compensatory Reserve to Predict Hemorrhagic Shock |
|
Gupta, Jay | MIT Lincoln Laboratory |
Telfer, Brian | MIT Lincoln Laboratory |
Convertino, Victor | U.S. Army Institute of Surgical Research |
Keywords: Signal pattern classification, Physiological systems modeling - Signal processing in physiological systems
Abstract: Hemorrhage is the leading cause of preventable death from trauma. Traditionally, vital signs have been used to detect blood loss and possible hemorrhagic shock. However, vital signs are not sensitive for early detection because of physiological mechanisms that compensate for blood loss. As an alternative, machine learning algorithms that operate on a noninvasive arterial blood pressure (ABP) waveform acquired via photoplethysmography have been shown to provide an effective early indicator. However, these machine learning approaches lack physiological interpretability. In this paper, we evaluate the importance of nine ABP-derived features that provide physiological insight, using a database of 40 human subjects from a lower-body negative pressure model of progressive central hypovolemia. One feature was found to be considerably more important than any other. That feature, the half-rise to dicrotic notch (HRDN), measures an approximate time delay between the ABP ejected and reflected wave components. This delay is an indication of compensatory mechanisms such as reduced arterial compliance and vasoconstriction. For a scale of 0% to 100%, with 100% representing normovolemia and 0% representing decompensation, linear regression of the HRDN feature results in root-mean-squared error of 16.9%, R2 of 0.72, and an area under the receiver operating curve for detecting decompensation of 0.88. These results are comparable to previously reported results from the more complex black box machine learning models.
|
|
14:45-15:00, Paper WeCT4.4 | |
Multiscale Information Decomposition of Long Memory Processes: Application to Plateau Waves of Intracranial Pressure |
|
Pinto, Helder | Universidade Do Porto Fac Ciencias |
Dias, Celeste | Faculty of Medicine of the University of Porto |
Rocha, Ana Paula | Universidade Do Porto, Faculdade De Ciencias |
Keywords: Physiological systems modeling - Signal processing in physiological systems, Directionality, Multivariate methods
Abstract: Traumatic Brain Injury (TBI) patients present high levels of physical stress, which in some situations can manifest as Plateau Wave (PW) episodes. This intense stress phenomenon can be evidenced by Heart Rate Variability (HRV). Thus, the multivariate and simultaneous analysis of cardiocerebrovascular oscillations, involving the RR intervals, mean arterial pressure (MAP) and the amplitude of intracranial pressure (AMP), will be useful to understand the interconnections between body signals, allowing the interpretation of the combined activity of pathophysiological mechanisms. In this work, the multiscale representation of the Transfer Entropy (TE) and of its decomposition in the network of these three interacting processes is obtained, based on a Vector AutoRegressive Fractionally Integrated (VARFI) framework for Gaussian processes. This method allows to assess directed interactions and to quantify the information flow accounting for the simultaneous presence of short-term dynamics and long range correlations. The results show that the baseline RR, but not MAP can provide information about the possibility of a PW arising. During PW, the long-term correlations highlight synergistic interactions between MAP and AMP processes on RR. The multiscale decomposition of the information along with the incorporation of the long term correlations allowed a better description of HRV during PW, highlighting the fact that the HRV mirrors this cerebrovascular phenomena.
|
|
15:00-15:15, Paper WeCT4.5 | |
A Fast and Accurate Learning-Based Decoding Algorithm for the Classification of Cardiovascular and Respiratory Challenges Using Intraneural Electrodes in the Pig Vagus Nerve |
|
Pollina, Leonardo | The BioRobotics Institute and with the Department of Excellence |
Vallone, Fabio | The BioRobotics Institute and Department of Excellence in Roboti |
Ottaviani, Matteo | Scuola Superiore Sant'Anna |
Strauss, Ivo | Scuola Superiore Sant'Anna |
Recchia, Fabio | Instituto Di Scienze Della Vita |
Moccia, Sara | Scuola Superiore Sant'Anna |
Micera, Silvestro | Scuola Superiore Sant'Anna |
Keywords: Signal pattern classification, Neural networks and support vector machines in biosignal processing and classification, Data mining and big data methods - Machine learning and deep learning methods
Abstract: Bioelectronic medicine is a new approach for developing closed-loop neuromodulation protocols on the peripheral nervous system (PNS) to treat a wide range of disorders currently treated with pharmacological approaches. Algorithms need to have low computational cost in order to acquire, process and model data for the modulation of the PNS in real time. Here, we present a fast learning-based decoding algorithm for the classification of cardiovascular and respiratory functional alterations (i.e., challenges) by using neural signals recorded from intraneural electrodes implanted in the vagus nerve of 5 pigs. Our algorithm relies on 9 handcrafted features, extracted following signal temporal windowing, and a multi-layer perceptron (MLP) for feature classification. We achieved fast and accurate classification of the challenges, with a computational time for feature extraction and prediction lower than 1.5 ms. The MLP achieved a balanced accuracy higher than 80% for all recordings. Our algorithm could represent a step towards the development of a closed-loop system based on a single intraneural interface with both the potential of real time classification and selective modulation of the PNS.
|
|
15:15-15:30, Paper WeCT4.6 | |
Accurate Chronic Stress Estimation with Personalized Models Based on Correlation Maximization |
|
Tsujikawa, Masanori | NEC Corporation |
Kitade, Tasuku | NEC Corporation |
Suzuki, Keisuke | NEC Corporation |
Shibuya, Kei | NEC Corporation |
Keywords: Signal pattern classification, Physiological systems modeling - Signal processing in physiological systems, Adaptive filtering
Abstract: We propose an accurate chronic stress estimation system that utilizes personalized models based on correlation maximization between physiological features and ground truth, which helps determine physiological features effective for the estimation. The personalized models are trained using features respectively found for each individual classes among which the relationships between features and ground truth differ. Which class a new user belongs to can be estimated from the results of a personality questionnaire, as well as by means of conventional methods. W.r.t. evaluation data, with the cooperation of 168 subjects, 599 sets of 1-month wearable-sensor data and ground-truth Perceived Stress Scale (PSS) data were collected, along with the Big Five Personality Traits for each subject. In chronic stress estimation evaluations using this above data, we have confirmed that the proposed classification system achieved 69.1% estimation accuracy in terms of increase/decrease in PSS, as compared to 59.3% and 56.8% achieved, respectively, with two conventional methods, one employing no classification and the other employing k-means clustering.
|
|
WeCT5 |
Carron-1 |
Theme 07. Human Activity Recognition |
Oral Session |
Chair: Min, Cheol-Hong | University of St. Thomas |
Co-Chair: Vanrumste, Bart | Katholieke Universiteit Leuven |
|
14:00-14:15, Paper WeCT5.1 | |
Exploring Human Activity Recognition Using Feature Level Fusion of Inertial and Electromyography Data |
|
Celik, Yunus | Northumbria University |
Stuart, Samuel | Northumbria University |
Woo, Wai Lok | Northumbria University |
Pearson, Liam | Northumbria University |
Godfrey, Alan | Northumbria University |
Keywords: Wearable body sensor networks and telemetric systems, Wearable sensor systems - User centered design and applications, Health monitoring applications
Abstract: Wearables are objective tools for human activity recognition (HAR). Advances in wearables enable synchronized multi-sensing within a single device. This has resulted in studies investigating the use of single or multiple wearable sensor modalities for HAR. Some studies use inertial data, others use surface electromyography (sEMG) from multiple muscles and different post-processing approaches. Yet, questions remain about accuracies relating to e.g., multi-modal approaches, and sEMG post-processing. Here, we explored how inertial and sEMG could be efficiently combined with machine learning and used with post-processing methods for better HAR. This study aims recognition of four basic daily life activities; walking, standing, stair ascent and descent. Firstly, we created a new feature vector based on the domain knowledge gained from previous mobility studies. Then, a feature level data fusion approach was used to combine inertial and sEMG data. Finally, two supervised learning classifiers (Support Vector Machine, SVM, and the k-Nearest Neighbors, kNN) were tested with 5-fold cross-validation. Results show the use of inertial data with sEMG increased overall accuracy by 3.5% (SVM) and 6.3% (kNN). Extracting features from linear envelopes instead of bandpass filtered sEMG improves overall HAR accuracy in both classifiers.
|
|
14:15-14:30, Paper WeCT5.2 | |
A User-Centric Approach for Personalization Based on Human Activity Recognition |
|
Boucharas, Dimitrios G. | Biomedical Research Institute, FORTH, GR 45110, Ioannina, Greece |
Androutsos, Christos | Department of Biomedical Research, Institute of Molecular Biolog |
Tachos, Nikolaos | Unit of Medical Technology and Intelligent Information Systems, |
Tripoliti, Evanthia | University of Ioannina |
Manousos, Dimitris | ICS-FORTH |
Jensen, Peter Skov | Department of Ophthalmology, Aarhus University Hospital, DK-8200 |
Castillon Torre, Luis | Service of Ophthalmology, Hospital San Juan De Dios Del Aljarafe |
Tsiknakis, Manolis | ICS-FORTH |
Fotiadis, Dimitrios I. | University of Ioannina |
Keywords: Wearable sensor systems - User centered design and applications, Modeling and analysis
Abstract: The objective of this work focuses on multiple independent user profiles that capture behavioral, emotional, medical, and physical patterns in the working and living environment resulting in one general user profile. Depending on the user’s current activity (e.g. walking, eating, etc.), medical history, and other influential factors, the developed framework acts as a supplemental assistant to the user by providing not only the ability to enable supportive functionalities (e.g. image filtering, magnification, etc.) but also informative recommendations (e.g. diet, alcohol, etc.). The personalization of such a profile lies within the user’s past preferences using human activity recognition as a base, and it is achieved through a statistical model, the Bayesian belief network. Training and real-time methodological pipelines are introduced and validated. The employed deep learning techniques for identifying human activities are presented and validated in publicly available and in-house datasets. The overall accuracy of human activity recognition reaches up to 86.96%.
|
|
14:30-14:45, Paper WeCT5.3 | |
Subtle Motion Detection Using Wi-Fi for Hand Rest Tremor in Parkinson’s Disease |
|
Chen, Shih-Yuan | National Cheng Kung University |
Lin, Chi-Lun | National Cheng Kung Univeristy |
Keywords: IoT sensors for health monitoring, Health monitoring applications, Physiological monitoring - Novel methods
Abstract: Parkinson's disease (PD) affects 1% of the population over the age of 60, and its prevalence increases with age. The disease progresses over time, and the condition can vary significantly in a day, which makes it difficult for precise diagnosis and medication based on short clinical sessions. Therefore, home health monitoring can play an important role in improving the healthcare of the PD patients. In this study, we proposed a method to detect, classify, and quantify daily movements and motor symptoms of PD by using the wireless sensing technology. With the presence of human movements in a space with the Wi-Fi coverage, the channel state information (CSI) of the wireless signal was transformed into images. The images were used to train a deep learning model to distinguish between different daily movements and simulated tremor. The results showed that our method obtained 99.59% and 100% accuracy of recognizing the tremor with modified VGG19 and modified Resnet152, respectively. In addition, the tremor movement was then successfully segmented out and quantified for the frequency and duration.
|
|
14:45-15:00, Paper WeCT5.4 | |
Drinking Gesture Detection Using Wrist-Worn IMU Sensors with Multi-Stage Temporal Convolutional Network in Free-Living Environments |
|
Wang, Chunzhuo | KU Leuven |
T., Sunil Kumar | KU Leuven |
De Raedt, Walter | Imec |
Camps, Guido | Wageningen University and Research |
Hallez, Hans | KU Leuven |
Vanrumste, Bart | Katholieke Universiteit Leuven |
Keywords: Modeling and analysis, Novel methods
Abstract: Maintaining adequate hydration is important for health. Inadequate liquid intake can cause dehydration problems. Despite the increasing development of liquid intake monitoring, there are still open challenges in drinking detection under free-living conditions. This paper proposes an automatic liquid intake monitoring system comprised of wrist-worn Inertial Measurement Units (IMUs) to recognize drinking gesture in free-living environments. We build an end-to-end approach for drinking gesture detection by employing a novel multi-stage temporal convolutional network (MS-TCN). Two datasets are collected in this research, one contains 8.9 hours data from 13 participants in semi-controlled environments, the other one contains 45.2 hours data from 7 participants in free-living environments. The Leave-One-Subject-Out (LOSO) evaluation shows that this method achieves a segmental F1-score of 0.943 and 0.900 in the semi-controlled and free-living datasets, respectively. The results also indicate that our approach outperforms the convolutional neural network and long-short-term-memory network combined model (CNN-LSTM) on our datasets. The dataset used in this paper is available at https://github.com/Pituohai/drinking-gesture-dataset/.
|
|
15:00-15:15, Paper WeCT5.5 | |
Classification of Handwashing and Similar Activities |
|
Eggen, Trygve | University of St. Thomas |
Min, Cheol-Hong | University of St. Thomas |
Keywords: IoT sensors for health monitoring, Sensor systems and Instrumentation, Modeling and analysis
Abstract: In this paper, an ensemble gentle boost decision tree classification algorithm is trained to classify handwashing from similar activities such as applying lotion to hands. Data is collected using a 3-axis accelerometer and gyroscope worn on the wrist. First, the data collection procedure is described. Then, feature identification is discussed. Once the feature matrix was created, the MATLAB classification learner app was used to classify the data based on the identified features. The overall classification rate achieved was 91.6% using an optimized boosted ensemble classifier.
|
|
15:15-15:30, Paper WeCT5.6 | |
Analyzing Impact of Mouthpiece-Based Puff Topography Devices on Smoking Behavior Using Wearable Sensors |
|
Belsare, Prajakta | DARTMOUTH COLLEGE |
Senyürek, Volkan | The University of Alabama |
Imtiaz, Masudul Haider | University of Alabama |
Betts, Jennifer | University at Buffalo |
Dowd, Ashley | University at Buffalo, the State University of New York |
Motschman, Courtney | University of Missouri, Columbia |
Tiffany, Stephen | State University of New York at Buffalo |
Sazonov, Edward | University of Alabama |
Keywords: New sensing techniques, Health monitoring applications, Physiological monitoring - Modeling and analysis
Abstract: Detailed assessment of smoking topography (puffing and post-puffing metrics) can lead to a better understanding of factors that influence tobacco use. Research suggests that portable mouthpiece-based devices used for puff topography measurement may alter natural smoking behavior. This paper evaluated the impact of a portable puff topography device (CReSS Pocket) on puffing & post-puffing topography using a wearable system, the Personal Automatic Cigarette Tracker v2 (PACT 2.0) as a reference measurement. Data from 45 smokers who smoked one cigarette in the lab and an unrestricted number of cigarettes under free-living conditions over 4 consecutive days were used for analysis. PACT 2.0 was worn on all four days. A puff topography instrument (CReSS pocket) was used for cigarette smoking on two random days during the four days of study in the laboratory and free-living conditions. Smoke inhalations were automatically detected using PACT2.0 signals. Respiratory smoke exposure metrics (i.e., puff count, duration of cigarette, puff duration, inhale-exhale duration, inhale-exhale volume, volume over time, smoke hold duration, inter-puff interval) were computed for each puff/smoke inhalation. Analysis comparing respiratory smoke exposure metrics during CReSS days and days without CReSS revealed a significant difference in puff duration, inhale-exhale duration and volume, smoke hold duration, inter-puff interval, and volume over time. However, the number of cigarettes per day and number of puffs per cigarette were statistically the same irrespective of the use of the CReSS device. The results suggested that the use of mouthpiece-based puff topography devices may influence measures of smoking topography with corresponding changes in smoking behavior and smoke exposure.
|
|
WeCT6 |
Carron-2 |
Theme 08. Biomechatronics and Robotics |
Oral Session |
Co-Chair: Caruso, Marco | Politecnico Di Torino |
|
14:00-14:15, Paper WeCT6.1 | |
Classification of Human Balance Recovery Strategies through Kinematic Motor Synergy Analysis |
|
Shen, Keli | Tohoku University |
chemori, ahmed | LIRMM |
Hayashibe, Mitsuhiro | Tohoku University |
Keywords: Biomechanics and robotics - Clinical evaluation in rehabilitation and orthopedics, Mechanics of locomotion and balance, New technologies and methodologies in human movement analysis
Abstract: A key problem in human balance recovery lies in understanding the mechanism of balance behavior with redundant bio-mechanical motors. Motor synergy has been known as an efficient tool to analyze characteristics of motion behavior and reconstruct control command. In this paper, motor synergy analysis for different control strategies is proposed to analyze different balance motion coordination for various levels of pushing force, and understand the coordination of human multiple joints regarding balance recovery. The spatial synergy of specific joint angles for different pushing levels is computed with the principal component analysis (PCA) to evaluate the adaptive balance motion response patterns and illustrate the improvement of balance robustness through the transition of joint coordination. Therefore, coordination transitions over multiple joints in balance recovery movements were analyzed to better understand the mechanism of balance strategy generation in this study.
|
|
14:15-14:30, Paper WeCT6.2 | |
Formal Pump Heel Height Affects the External Force Exerted on the Foot During Normal Walking |
|
Kitagawa, Yuka | Chiba University |
Umeda, Maho | Chiba University Graduate School of Nursing |
Nakashima, Yukiko | Chiba University Graduate School of Nursing |
Kawano, Mio | Chiba University Graduate School of Nursing |
Amemiya, Ayumi | Chiba University |
Keywords: Mechanics of locomotion and balance, New technologies and methodologies in human movement analysis, Applied tissue and organ models and motion analysis
Abstract: Forefoot pain, hallux valgus, shoe sore, flat foot, and calluses are among the common foot problems encountered by high heel wearers. This study aimed to investigate the external forces associated with shoe sore and callus while wearing formal heel shoes. The external force on the 1st, 2nd, and 5th metatarsal heads and heel center was measured using the ShokacChip. Women were asked to wear pumps with four heel heights (10, 30, 55, and 80 mm) and walk 15 m twice. Thirty-five women were included. The data of two participants were excluded due to sensor fault. With higher heels, normal stress (pressure) was significantly stronger on the inside of the forefoot and significantly weaker on the outside. Shear stress did not always increase or decrease proportionally with respect to heel height. SPR-i of the forefoot associated with callus formation was minimal in the 30-mm heel.
|
|
14:30-14:45, Paper WeCT6.3 | |
Systematic Motion Integration with Multiple Depth Cameras Allowing Sensor Movement for Stable Skeleton Tracking |
|
Furuhata, Kazuki | Tohoku University |
Kutsuzawa, Kyo | Tohoku University |
Owaki, Dai | Tohoku University |
Hayashibe, Mitsuhiro | Tohoku University |
Keywords: New technologies and methodologies in human movement analysis, Rehabilitation robotics and biomechanics - Integrated diagnostic and therapeutic systems
Abstract: In recent years, markerless motion capture using a depth camera or RGB camera without any restriction on the subject has been attracting attention. Especially, depth cameras such as Kinect and RealSense allow instantaneous motion capture even at home outside lab environment, which is attractive for rehabilitation usage. However, single depth camera can capture steadily skeleton only when the subject stands facing to camera for the limited range, thus it is hard to apply to track skeletons while walking. Multiple depth cameras setting may allow to expand the range, but it can involve non-practical calibration process and can affect instantaneous capture advantage of depth camera. In this study, we propose a systematic method to integrate the motion information of skeletal models obtained from multiple depth cameras. The proposed method can perform a quick calibration using skeletal models instead of external reference objects, and estimate the spatial relationship of the sensors that allows the depth camera to move. The result demonstrates stable skeleton tracking free from occlusion problem keeping instantaneous capture capability of depth cameras.
|
|
14:45-15:00, Paper WeCT6.4 | |
An ISB-Consistent Denavit-Hartenberg Model of the Human Upper Limb for Joint Kinematics Optimization: Validation on Synthetic and Robot Data During a Typical Rehabilitation Gesture |
|
Caruso, Marco | Politecnico Di Torino |
Gastaldi, Laura | Politecnico Di Torino |
Pastorelli, Stefano | Politecnico Di Torino |
Cereatti, Andrea | Politecnico Di Torino |
Digo, Elisa | Politecnico Di Torino |
Keywords: Optimization in musculoskeletal biomechanics, Modeling and simulation in musculoskeletal biomechanics, Joint biomechanics
Abstract: Several biomedical contexts such as diagnosis, rehabilitation, and ergonomics require an accurate estimate of human upper limbs kinematics. Wearable inertial measurement units (IMUs) represent a suitable solution because of their unobtrusiveness, portability, and low-cost. However, the time-integration of the gyroscope angular velocity leads to an unbounded orientation drift affecting both angular and linear displacements over long observation interval. In this work, a Denavit-Hartenberg model of the upper limb was defined in accordance with the guidelines of the International Society of Biomechanics and exploited to design an optimization kinematics process. This procedure estimated the joint angles by minimizing the difference between the modelled and IMU-driven orientation of upper arm and forearm. In addition, reasonable constraints were added to limit the drift influence on the final joint kinematics accuracy. The validity of the procedure was tested on synthetic and experimental data acquired with a robotic arm over 20 minutes. Average rms errors amounted to 2.8 deg and 1.1 for synthetic and robot data, respectively.
|
|
15:00-15:15, Paper WeCT6.5 | |
Users Maintain Task Accuracy and Gait Characteristics During Missed Exoskeleton Actuations through Adaptations in Joint Kinematics |
|
Wu, Man I | University of Michigan |
Baum, Brian S. | MIT Lincoln Laboratory |
Edwards, Harvey | MIT |
Stirling, Leia | University of Michigan |
Keywords: Robotics - Orthotics and Exoskeletons, Wearable robotic systems - Orthotics and Exoskeletons, Exoskeleton applications
Abstract: In operational settings, lower-limb active exoskeletons may experience errors, where an actuation that should be present is missed. These missed actuations may impact users' trust in the system and the adapted human-exoskeleton coordination strategies. In this study, we introduced pseudo-random catch trials, in which an assistive exoskeleton torque was not applied, to understand the immediate responses to missed actuations and how users' internal models to an exoskeleton adapt upon repeated exposure to missed actuations. Participants (N = 15) were instructed to complete a stepping task while wearing a bilateral powered ankle exoskeleton. Human-exoskeleton coordination and trust were inferred from task performance (step accuracy), step characteristics (step length and width), and joint kinematics at selected peak locations of the lower limb. Step characteristics and task accuracy were not impacted by the loss of exoskeleton torque as hip flexion was modulated to support completing the stepping task during catch trials, which supports an impacted human-exoskeleton coordination. Reductions in ankle plantarflexion during catch trials suggest user adaptation to the exoskeleton. Trust was not impacted by catch trials, as there were no significant differences in task performance or gait characteristics between earlier and later strides. Understanding the interactions between human-exoskeleton coordination, task accuracy, and step characteristics will support development of exoskeleton controllers for non-ideal operational settings.
|
|
15:15-15:30, Paper WeCT6.6 | |
Developing an Upper Limb Kinematics Database of Activities of Daily Living |
|
Huang, Mia | San Francisco State University |
Freitas, Sandra | Universidade Cidade De São Paulo |
Bagesteiro, Leia B | San Francisco State University |
Keywords: Joint biomechanics, New technologies and methodologies in human movement analysis, Biomechanics and robotics - Clinical evaluation in rehabilitation and orthopedics
Abstract: Open-access databases can facilitate data sharing among researchers and provide normative data for objective clinical assessment development, robotic design, and biomechanical modeling. However, most existing databases focus on gait, balance, and hand gestures without providing elbow and shoulder kinematics that are required in activities of daily living. Furthermore, the few existing upper limb datasets include small sample sizes without consistent data collection protocols, which hinder robotic engineers’ ability to design robotic devices that accommodate the general population. To address the literature gap, an open-access upper limb kinematic database was proposed. Due to the impact of COVID-19 on human research, only data from 16 participants were collected.
|
|
WeCT9 |
Gala |
Theme 02. Image Analysis and Classification - Machine Learning / Deep
Learning Approaches - III |
Oral Session |
Chair: Suzuki, Kenji | Tokyo Institute of Technology |
Co-Chair: Lee, Ashe XinYee | Singapore Eye Research Institute |
|
14:00-14:15, Paper WeCT9.1 | |
Diverse Multi-Expert Network for Multi-Disease Detection in Retinal Fundus Images |
|
Lyu, Linquan | University of Missouri |
Toubal, Imad | University of Missouri |
Palaniappan, Kannappan | University of Missouri-Columbia |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches
Abstract: Automatic diagnosis of eye diseases from retinal fundus images is quite challenging. Common public datasets include images of subjects with multiple diseases with uneven distribution of labels. Rare diseases are especially challenging due to their under-representation in such datasets. In this paper, we propose a training pipeline for the multi-labeled classification with uneven distribution of the sample size and sample difficulty. First, we guide the training of the initial model by weighing the training loss using an inverse-frequency for each class. This will balance the training on over-represented and under-represented samples. We then adjust the class weights using the aggregated loss for each class, and train for more iterations. In this way, the model at each iteration will focus more on difficult samples and cover the shortcomings of the previous model. Finally, we ensemble together all the models and apply post-processing for improving multi-label predictions. Experiments on the Retinal Fundus Multi-Disease Image Dataset (RFMiD) prove the effectiveness of our pipeline. Furthermore, our method provides new ideas for the training of deep learning for the multi-label classification task and imbalanced datasets.
|
|
14:15-14:30, Paper WeCT9.2 | |
MEG-Based Classification and Grad-CAM Visualization for Major Depressive and Bipolar Disorders with Semi-CNN |
|
Huang, Chun-Chih | National Yang Ming Chiao Tung University |
Low, Intan | National Yang Ming Chiao Tung University |
Kao, Chia-Hsiang | National Yang Ming Chiao Tung University |
Yu, Chuan-Yu | National Yang Ming Chiao Tung University |
Su, Tung-Ping | Taipei Veterans General Hospital |
Hsieh, Jen-Chuen | Taipei Veterans General Hospital |
Chen, Yong-Sheng | National Yang Ming Chiao Tung University |
Chen, Li-Fen | National Yang Ming Chiao Tung University |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Multivariate image analysis, MEG imaging
Abstract: Major depressive disorder (MDD) and bipolar disorder (BD) are two major mood disorders with partly overlapped symptoms but different treatments. However, their misdiagnosis and mistreatment are common based on the DSM-V criteria, lacking objective and quantitative indicators. This study aimed to develop a novel approach that accurately classifies MDD and BD based on their resting-state magnetoencephalography (MEG) signals during euthymic phases. A revisited 3D CNN model, Semi-CNN, that could automatically detect brainwave patterns in spatial, temporal, and frequency domains was implemented to classify wavelet-transformed MEG signals of normal controls and MDD and BD patients. The model achieved 96.05% testing accuracy and an average of 95.71% accuracy for 5-fold cross-validation. Furthermore, saliency maps of the model were estimated using Grad-CAM++ for visual explanation of the proposed classification model and highlighting the disease-specific brain regions and frequencies.
|
|
14:30-14:45, Paper WeCT9.3 | |
Extravasation Screening and Severity Prediction from Skin Lesion Image Using Deep Neural Networks |
|
Munthuli, Adirek | Thammasat University |
Intanai, Jace | Thammasat University |
Tossanuch, Pornnapat | Faculty of Engineering, Thammasat University |
Pooprasert, Pakinee | Cardiff University School of Medicine |
Ingpochai, Piyatouchporn | Abingdon School |
Boonyasatian, Sirirat | Faculty of Engineering, Thammasat University |
Kittithammo, Kittikhun | Faculty of Engineering, Thammasat University |
Thammarach, Purinat | Thammasat University |
Boonmak, Tanapol | Faculty of Engineering, Thammasat University |
Khaengthanyakan, Suntara | Thammasat University |
Yaemsuk, Akarachai | Thammasat University |
Vanichvarodom, Podsirin | Thammasat University |
Phienphanich, Phongphan | Thammasat University |
Pongcharoen, Padcha | Faculty of Medicine, Thammasat University |
Sakonlaya, Dussadee | Faculty of Medicine, Thammasat University |
Sitthiwatthanawong, Pradtana | Faculty of Medicine, Thammasat University |
Wetchawalit, Sinee | Faculty of Medicine, Thammasat University |
Chakkavittumrong, Panlop | Faculty of Medicine, Thammasat University |
Thongthawee, Borwarnluck | Faculty of Nursing, Thammasat University |
Pathomjaruwat, Thitiporn | Faculty of Nursing, Thammasat University |
Tantibundhit, Charturong | Thammasat University |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image classification, Image segmentation
Abstract: Extravasation occurs secondary to the leakage of medication from blood vessels into the surrounding tissue during intravenous administration resulting in significant soft tissue injury and necrosis. If treatment is delayed, invasive management such as surgical debridement, skin grafting and even amputation may be required. Thus, it is imperative to develop a develop a smartphone application for predicting level of extravasation severity from skin image. Two Deep Neural Network (DNN) architectures, U-Net and DenseNet-121, were used to segment skin and lesion, and to classify extravasation severity. Sensitivity and specificity for predicting binary classification between asymptomatic and abnormal cases were 77.78 and 90.24%. For severe extravasation attained the highest accuracy of 90.83%, followed by mild extravasation of 85.32%, and moderate extravasation of 77.98%. The accuracy of moderate-to-severe extravasation classification can be improved by using an ensemble model for multi-class classification. These findings proposed a novel and feasible DNN approach for screening extravasation from skin images. The implementation of DNN-based applications on mobile devices has a strong potential for clinical application in low-resource countries.
|
|
14:45-15:00, Paper WeCT9.4 | |
PENet: Continuous-Valued Pulmonary Edema Severity Prediction on Chest X-Ray Using Siamese Convolutional Networks |
|
Akbar, Md Navid | Northeastern University |
Wang, Xin | Philips Research North America |
Erdogmus, Deniz | Northeastern University |
Dalal, Sandeep | Philips Research North America |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, X-ray radiography
Abstract: For physicians to make rapid clinical decisions for patients with congestive heart failure, the assessment of pulmonary edema severity in chest radiographs is vital. Although deep learning has shown promise in detecting the presence or absence or discrete grades of severity, of such edema, prediction of continuous-valued severity yet remains a challenge. Here, we propose PENet: Siamese convolutional neural networks to assess the continuous spectrum of severity of lung edema from chest radiographs. We present different modes of implementing this network and demonstrate that our best model outperforms that of earlier work (mean AUC of 0.91 over 0.87), while using only 1/16-th the dimension of input images and 1/69-th the size of training data, thus also saving expensive computation.
|
|
15:00-15:15, Paper WeCT9.5 | |
Automated Retinal Vascular Topological Information Extraction from OCTA |
|
Lee, Ashe XinYee | Singapore Eye Research Institute |
Saxena, Ashish | Singapore Eye Research Institute |
Chua, Jacqueline | Singapore Eye Research Institute |
Schmetterer, Leopold | Singapore Eye Research Institute |
Tan, Bingyao | Singapore Eye Research Institute |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image segmentation, Optical imaging - Coherence tomography
Abstract: The retinal vascular system adapts and reacts rapidly to ocular diseases, such as glaucoma, diabetic retinopa- thy and age-related macular degeneration. Here we present a combination of methods to further extract vascular information from 12x12mm wide-field optical coherence tomography angiog- raphy (OCTA). An integrated U-Net for the segmentation and classification of arteries and veins reached a segmentation IoU of 0.7095 0.0224, and classification IoU of 0.8793 0.1049 and 0.8928 0.0929 respectively. A correcting algorithm which uses topological information was created to correct the misclassification and connectivity of the vessels, which showed an average increase of 8.29% in IoU. Finally, the vessel morphometry of branch orders was extracted, where this allows the direct comparison of artery/vein, arterioles/venules and capillaries.
|
|
15:15-15:30, Paper WeCT9.6 | |
Encoding Deep Residual Features into Fisher Vector for Skin Lesion Classification |
|
胡, 航语 | NWPU |
Chen, Ziyang | Northwestern Polytechnical University |
Xia, Yong | Northwestern Polytechnical University |
Keywords: Image classification, Image analysis and classification - Machine learning / Deep learning approaches, Image feature extraction
Abstract: Computer-aided skin lesion classification using dermoscopy is essential for early detection of melanoma, which is the most effective means to reduce the mortality rate. Although many deep learning models have been designed for this task, skin lesion classification remains challenging due to the small sample size, inter-class similarity, intra-class inconsistency, and class imbalance. In this paper, we propose a hybrid deep residual network and Fisher vector (ResNet-FV) algorithm for skin lesion classification, aiming to boost the performances of ResNet using the Fisher vector encoding scheme. The proposed algorithm has been evaluated on the 2018 Skin Lesion Analysis Towards Melanoma Detection Challenge (ISIC-skin 2018) dataset and achieved a balanced multi-class accuracy of 0.798, outperforming several existing solutions.
|
|
WeCT10 |
Forth |
Theme 02. Magnetic Resonance Imaging - Neuroimaging |
Oral Session |
Chair: Ding, Lei | University of Oklahoma |
Co-Chair: Chan, Russell | Gense Technologies Ltd |
|
14:00-14:15, Paper WeCT10.1 | |
NOise Reduction with DIstribution Corrected (NORDIC) PCA Improves Signal-To-Noise in Rodent Resting-State and Optogenetic Functional MRI |
|
Chan, Russell | NYU School of Medicine |
Lee, Royce | NYU Grossman School of Medicine |
Wu, Sarah | NYU Grossman School of Medicine |
Tse, Emily | New York University Grossman School of Medicine |
Xue, Yixi | New York University School of Medicine |
Moeller, Steen | University of Minnesota |
Chan, Kevin C. | New York University |
Keywords: Magnetic resonance imaging - MR neuroimaging, Image enhancement - Denoising, Brain imaging and image analysis
Abstract: NOise Reduction with DIstribution Corrected (NORDIC) principal component analysis (PCA) has been shown to selectively suppress thermal noise and improve temporal signal-to-noise ratio (tSNR) in human functional magnetic resonance imaging (fMRI). However, the feasibility to improve rodent fMRI using NORDIC PCA has not been explored. In this study, we developed a rodent fMRI preprocessing pipeline by incorporating NORDIC and evaluated its performance in a range of rodent fMRI applications from resting-state fMRI to task-evoked fMRI using optogenetics. In resting-state fMRI, we demonstrated a significant increase in tSNR by more than 3 times after NORDIC correction with reduced variance and improved task-free relative cerebrovascular reactivity across cortical depth. In optogenetic fMRI, apart from tSNR increase, more activated voxels and a significant decrease in the variance of activated brain signals were observed after NORDIC correction without apparent change in brain morphology. Taken together, our results signified the values of NORDIC correction for better detection of brain activities in rodent fMRI.
|
|
14:15-14:30, Paper WeCT10.2 | |
Improving Autism Spectrum Disorder Prediction by Fusion of Multiple Measures of Resting-State Functional MRI Data |
|
Liang, Lingyan | Inspur |
Dong, Gang | Inspur |
Li, Changsheng | Beijing Institute of Technology |
Wen, Dongchao | Inspur |
Zhao, Yaqian | Inspur |
Li, Jing | Institute of Psychology, CAS |
Keywords: Brain imaging and image analysis, Magnetic resonance imaging - MR neuroimaging
Abstract: Autism spectrum disorder (ASD) is a lifelong neurodevelopmental condition characterized by social communication, language and behavior impairments. Leveraging deep learning to automatically predict ASD has attracted more and more attention in the medical and machine learning communities. However, how to select effective measure signals for deep learning prediction is still a challenging problem. In this paper, we studied two kinds of measure signals, i.e., regional homogeneity (ReHo) and Craddock 200 (CC200), which both represents homogeneous functional activity, in the framework of deep learning, and designed a new mechanism to effectively joint them for deep learning based ASD prediction. Extensive experiments on the ABIDE dataset provide empirical evidence in support of effectiveness of our method. In particular, we obtained 79% in terms of accuracy by effectively fusing these two kinds of signals, much better than any single-measure model (ReHo SM-model: ~69% and CC200 SM-model: ~70%). These results suggest that leveraging multi-measure signals together are effective for ASD prediction.
|
|
14:30-14:45, Paper WeCT10.3 | |
Discovery and Replication of Time-Resolved Functional Network Connectivity Differences in Adolescence and Adulthood in Over 50K fMRI Datasets |
|
Abrol, Anees | Georgia State University, the Mind Research Network |
Calhoun, Vince D | Tri-Institutional Center for Translational Research in Neuroimag |
Keywords: Magnetic resonance imaging - MR neuroimaging, Machine learning / Deep learning approaches, Brain imaging and image analysis
Abstract: There remains an open question about whether and in what context brain function varies in adolescence and adulthood. In this work, we systematically study the functional brain networks of adolescents and adults, outlining the significant differences in the developing brain detected via time-resolved functional network connectivity (trFNC) derived from a fully automated independent component analysis pipeline applied to resting-state fMRI data in over 50K individuals. We then statistically analyze the transient, recurrent, and robust brain state profiles in both groups. We confirmed the results in independent replication datasets for both groups. Our findings indicate a strengthening of a state reflecting functional coupling within the visual, motor, and auditory domains and anticorrelation with all other domains in a unique adult state profile, a pattern consistently less modular in adolescents. This new insight into possible integration, strengthening, and modularization of resting-state brain connections beyond childhood convergently indicates that the highlighted temporal dynamics likely reflect robust differences in brain function in adolescents versus adults.
|
|
14:45-15:00, Paper WeCT10.4 | |
‘Harmless’ Adversarial Network Harmonization Approach for Removing Site Effects and Improving Reproducibility in Neuroimaging Studies |
|
Yan, Weizheng | Georgia State University |
Sui, Jing | Beijing Normal University |
Fu, Zening | Georgia State University |
Calhoun, Vince D | Tri-Institutional Center for Translational Research in Neuroimag |
Keywords: Magnetic resonance imaging - MR neuroimaging, Machine learning / Deep learning approaches, Image analysis and classification - Machine learning / Deep learning approaches
Abstract: Multi-site collaboration, which gathers together samples from multiple sites, is a powerful way to overcome the small-sample problem in the neuroimaging field and has the potential to discover more robust and reproducible biomarkers. However, confounds among the datasets caused by various site-specific factors may dramatically reduce the cross-site reproducibility performance. To properly remove confounds while improving cross-site task performances, we propose a maximum classifier discrepancy generative adversarial network (MCD-GAN) that combines the advantages of generative models and maximum discrepancy theory. The mechanisms of MCD-GAN and how it harmonizes the dataset are visualized using simulated data. The performance of MCD-GAN was also compared with state-of-the-art methods (e.g., ComBat, cycle-GAN) within Adolescent Brain Cognitive Development (ABCD) dataset. Result demonstrates that the proposed MCD-GAN can effectively improve the cross-site gender classification performance by harmonizing site effects. Our proposed framework is also suitable for various classification/prediction tasks and is promising to facilitate the cross-site reproducibility of neuroimaging studies.
|
|
15:00-15:15, Paper WeCT10.5 | |
True Location of Deep Brain Stimulation Electrodes Differs from What Is Seen on Postoperative Magnetic Resonance Images: An Anthropomorphic Phantom Study |
|
Nuzov, Noa | Northwestern University |
Bhusal, Bhumi | Northwestern University |
Henry, Kaylee | Northwestern University |
Jiang, Fuchang | Northwestern University |
Rosenow, Joshua | Northwestern University |
Elahi, Behzad | Northwestern University, Department of Physical Therapy and Huma |
Rad, Laleh Golestani | Northwestern University |
Keywords: Magnetic resonance imaging - MR neuroimaging, Image segmentation, Brain imaging and image analysis
Abstract: Deep brain stimulation (DBS) is an established yet growing treatment for a range of neurological and psychiatric disorders. Over the last decade, numerous studies have underscored the effect of electrode placement on the clinical outcome of DBS. As a result, imaging is now extensively used for DBS electrode localization, even though the accuracy of different modalities in determining the true coordinates of DBS electrodes is less explored. Postoperative magnetic resonance imaging (MRI) is the gold standard method for DBS electrode localization, however, the geometrical distortion induced by the lead's artifact could limit the accuracy. In this work, we investigated to what degree the difference between the true location of the lead's tip and the location of the tip estimated from the MRI artifact varies depending on the MRI sequence parameters, acquisition plane, phase encoding direction, and the implant's extracranial trajectory.
|
|
15:15-15:30, Paper WeCT10.6 | |
Spatially Constrained ICA Enables Robust Detection of Schizophrenia from Very Short Resting-State FMRI |
|
Duda, Marlena | Georgia State University |
Iraji, Armin | Georgia State University |
Calhoun, Vince D | Tri-Institutional Center for Translational Research in Neuroimag |
Keywords: Magnetic resonance imaging - MR neuroimaging, Brain imaging and image analysis, Functional image analysis
Abstract: Resting-state functional network connectivity (rsFNC) has shown utility for identifying characteristic functional brain patterns in individuals with psychiatric and mood disorders, providing a promising avenue for biomarker development. However, several factors have precluded widespread clinical adoption of rsFNC diagnostics, namely the lack of standardized approaches for capturing comparable and reproducible imaging markers across individuals, as well as the disagreement on the amount of data required to robustly detect intrinsic connectivity networks (ICNs) and diagnostically relevant patterns of rsFNC. Here, we investigate the robustness of (1) subject-specific ICNs standardized to an a priori network template via spatially constrained ICA (scICA), and (2) rsFNC differences between schizophrenia and control groups with respect to the length of the fMRI. Our results suggest clinical rsFMRI scans, when decomposed with scICA, could potentially be shortened to just 2-4 minutes without significant loss of individual rsFNC information or classification performance of longer scan lengths.
|
|
WeCT13 |
Hall 1 |
Theme 02. Optical Imaging |
Oral Session |
Co-Chair: Wong, Eddie C. | Gense Technologies Ltd |
|
14:00-14:15, Paper WeCT13.1 | |
Generation of Synthetic Data for the Comparison of Different 3D-3D Registration Approaches in Laparoscopic Surgery |
|
Krames, Lorena | Karlsruher Institute of Technology (KIT) |
Suppa, Per | Olympus Surgical Technologies Europe |
Nahm, Werner | Karlsruhe Institute of Technology |
Keywords: Optical imaging, Multimodal image fusion, Image registration, segmentation, compression and visualization - Volume rendering
Abstract: In laparoscopic surgery image-guided navigation systems could support the surgeon by providing subsurface information such as the positions of tumors and vessels. For this purpose, one option is to perform a reliable registration of preoperative 3D data and a surface patch from laparoscopic video data. A robust and automatic 3D-3D registration pipeline for the application during laparoscopic surgery has not yet been found due to application-specific challenges. To gain a better insight, we propose a framework enabling a qualitative and quantitative comparison of different registration approaches. The introduced framework is able to evaluate 3D feature descriptors and registration algorithms by generating and modifying synthetic data from clinical examples. Different confounding factors are considered and thus the reality can be reflected in any simplified or more complex way. Two exemplary experiments with a liver model, using the RANSAC algorithm, showed an increasing registration error for a decreasing size of the surface patch size and after introducing modifications. Moreover, the registration accuracy was dependent on the position and structure of the surface patch. The framework helps to quantitatively assess and optimize the registration pipeline, and hereby suggests future software improvements even with only few clinical examples.
|
|
14:15-14:30, Paper WeCT13.2 | |
Keratoconus Classifier for Smartphone-Based Corneal Topographer |
|
Gairola, Siddhartha | Microsoft Research India |
Joshi, Pallavi | Sankara Eye |
Balasubramaniam, Anand | Sankara Eye |
MURALI, KAUSHIK | Sankara Eye Foundation India |
Kwatra, Nipun | Microsoft Research India |
Jain, Mohit | Microsoft Research India |
Keywords: Optical imaging, Machine learning / Deep learning approaches, Image classification
Abstract: Keratoconus is a severe eye disease that leads to deformation of the cornea. It impacts people aged 10-25 years and is the leading cause of blindness in that demography. Corneal topography is the gold standard for keratoconus diagnosis. It is a non-invasive process performed using expensive and bulky medical devices called corneal topographers. This makes it inaccessible to large populations, especially in the Global South. Low-cost smartphone-based corneal topographers, such as SmartKC, have been proposed to make keratoconus diagnosis accessible. Similar to medical-grade topographers, SmartKC outputs curvature heatmaps and quantitative metrics that need to be evaluated by doctors for keratoconus diagnosis. An automatic scheme for evaluation of these heatmaps and quantitative values can play a crucial role in screening keratoconus in areas where doctors are not available. In this work, we propose a dual-head convolutional neural network (CNN) for classifying keratoconus on the heatmaps generated by SmartKC. Since SmartKC is a new device and only had a small dataset (114 samples), we developed a 2-stage transfer learning strategy—using historical data collected from a medical-grade topographer and a subset of SmartKC data—to satisfactorily train our network. This, combined with our domain-specific data augmentations, achieved a sensitivity of 91.3% and a specificity of 94.2%.
|
|
14:30-14:45, Paper WeCT13.3 | |
Age-Related Changes in Dynamic Corneal Backscatter Observed in Scheimpflug Imaging |
|
Miazdzyk, Maria Magdalena | Wrocław University of Science and Technology |
Iskander, D Robert | Wroclaw University of Technology |
Keywords: Optical imaging
Abstract: Abstract: Corneal visualization Scheimpflug technology (Corvis ST) has the potential to indirectly provide information on corneal structure via the analysis of statistical properties of the backscatter. The aim of this work was to ascertain whether there are age-related changes in the dynamics of corneal backscatter during an air-puffed induced corneal deformation. Retrospective data from Corvis ST measurements of 151 young subjects (19–30 years) and 82 older subjects (50–87 years) were considered. Each measurement consisted of 140 frames (sampling frequency: 4330 fps). For every frame the cornea was first segmented, then regions of interest, encompassing temporal, central and nasal parts of cornea were selected, to which the parameters of Weibull distribution (scale and shape) were fitted, leading to time series of the estimated parameters. Apparent differences were found between the parameters of Weibull distribution between the two considered groups that manifest themselves mostly in the nasal region of the cornea. However, those differences cannot be attributed to the age alone. For this, a normalization method is proposed that leads to a much better separation between the groups in all considered regions. Clinical relevance: The parameters of the corneal backscatter are widely used to asses corneal clarity (so-called corneal densitometry). Recently, the parameters of Weibull distribution, fitted to the corneal backscatter data, have been used to support diagnosis of keratoconus. This work contributes to the assessment of corneal clarity by identifying the apparent age-related differences in those parameters when dynamic raw data is considered, highlighting the need for such parameters to be appropriately normalized. Further, it shown that the shape parameter of Weibull distribution, unlike the scale parameter, carries that information already for the raw, non-normalized data.
|
|
14:45-15:00, Paper WeCT13.4 | |
Hardware Inspired Neural Network for Efficient Time-Resolved Biomedical Imaging |
|
Zang, Zhenya | University of Strathclyde |
Xiao, Dong | University of Strathclyde |
Wang, Quan | University of Strathclyde |
Jiao, Ziao | University of Strathclyde |
Li, Zinuo | University of Strathclyde |
Chen, Yu | Strathclyde University |
Li, David | University of Strathclyde |
Keywords: Optical imaging and microscopy - Fluorescence microscopy, Image reconstruction and enhancement - Machine learning / Deep learning approaches
Abstract: Convolutional neural networks (CNN) have revealed exceptional performance for fluorescence lifetime imaging (FLIM). However, redundant parameters and complicated topologies make it challenging to implement such networks on embedded hardware to achieve real-time processing. We report a lightweight, quantized neural architecture that can offer fast FLIM imaging. The forward-propagation is significantly simplified by replacing matrix multiplications in each convolution layer with additions and data quantization using a low bit-width. We first used synthetic 3-D lifetime data with given lifetime ranges and photon counts to assure correct average lifetimes can be obtained. Afterwards, human prostatic cancer cells incubated with gold nanoprobes were utilized to validate the feasibility of the network for real-world data. The quantized network yielded a 37.8% compression ratio without performance degradation.
|
|
15:00-15:15, Paper WeCT13.5 | |
Smart Wide-Field Fluorescence Lifetime Imaging System with CMOS Single-Photon Avalanche Diode Arrays |
|
Xiao, Dong | University of Strathclyde |
Zang, Zhenya | University of Strathclyde |
Wang, Quan | University of Strathclyde |
Jiao, Ziao | University of Strathclyde |
MATTIOLI DELLA ROCCA, Francesco | University of Edinburgh |
Chen, Yu | Strathclyde University |
Li, David | University of Strathclyde |
Keywords: Optical imaging and microscopy - Fluorescence microscopy, Image analysis and classification - Machine learning / Deep learning approaches
Abstract: Wide-field fluorescence lifetime imaging (FLIM) is a promising technique for biomedical and clinic applications. Integrating with CMOS single-photon avalanche diode (SPAD) sensor arrays can lead to cheaper and portable real-time FLIM systems. However, the FLIM data obtained by such sensor systems often have sophisticated noise features. There is still a lack of fast tools to efficiently recover lifetime parameters from highly noise-corrupted fluorescence signals. This paper proposes a smart wide-field FLIM system containing a 192 × 128 COMS SPAD sensor, and a field-programmable gate array (FPGA) embedded deep learning (DL) processor. The processor adopts a hardware-friendly and light-weighted neural network for fluorescence lifetime analysis, showing the advantages of high accuracy against noise, fast speed, and low power consumption. Experimental results demonstrate the proposed system's superior and robust performances, promising for many FLIM applications such as FLIM-guided clinical surgeries, cancer diagnosis, and biomedical imaging.
|
|
15:15-15:30, Paper WeCT13.6 | |
Deep Learning for Breast Cancer Classification of Deep Ultraviolet Fluorescence Images Toward Intra-Operative Margin Assessment |
|
To, Tyrell | Marquette University |
Heidari Gheshlaghi, Saba | Marquette University |
Ye, Dong Hye | Marquette University |
Keywords: Optical imaging and microscopy - Fluorescence microscopy, Image analysis and classification - Digital Pathology, Image analysis and classification - Machine learning / Deep learning approaches
Abstract: Breast conserving surgery aims at the complete removal of malignant lesions while minimizing healthy tissue loss. To ensure the balance between complete resection of the cancer and conservation of healthy tissue, intra-operative margin assessment is necessary. Deep ultra violet (DUV) fluorescence scanning microscope provides fast whole-surface-imaging (WSI) of excised tissue with contrast between malignant and normal tissues. Then, an automated breast cancer classification method on DUV images is required for intra-operative margin assessment. Deep learning shows the promising results in breast cancer classification, but limited DUV image dataset poses overfitting challenge to train the robust network. To tackle this challenge, we partition the DUV WSI image into small patches and extract pathological features for each patch from a pre-trained network using a transfer learning approach. We feed pathological features into a decision-tree-based classifier and fuse patch-level classification results based on regional importance to determine malignant or benign WSI. Experimental results on 60 DUV images show that our proposed method outperforms the standard deep learning classification in terms of improving the classification performance and identifying cancerous regions.
|
|
WeEP |
Hall 5 |
E-Poster Session II - July 13, 2022 |
Poster Session |
|
15:45-17:30, Subsession WeEP-01, Hall 5 | |
Theme 01. Deep Learning Methods for Cardiovascular Signals Poster Session, 6 papers |
|
15:45-17:30, Subsession WeEP-02, Hall 5 | |
Theme 01. Signal Processing & Classification of Cardiovascular Signals Poster Session, 11 papers |
|
15:45-17:30, Subsession WeEP-03, Hall 5 | |
Theme 01. Signal Processing & Classification of Speech and Acoustic Signals Poster Session, 7 papers |
|
15:45-17:30, Subsession WeEP-04, Hall 5 | |
Theme 01. Signal Processing and Classification of Cardiac Signals Poster Session, 6 papers |
|
15:45-17:30, Subsession WeEP-05, Hall 5 | |
Theme 02. Image Analysis and Classification - Machine Learning / Deep Learning Approaches - P2 Poster Session, 11 papers |
|
15:45-17:30, Subsession WeEP-06, Hall 5 | |
Theme 02. Image Reconstruction & Enhancement Poster Session, 9 papers |
|
15:45-17:30, Subsession WeEP-07, Hall 5 | |
Theme 02. Image Segmentation - P1 Poster Session, 9 papers |
|
15:45-17:30, Subsession WeEP-08, Hall 5 | |
Theme 02. Machine Learning/Deep Learning Applications - P1 Poster Session, 11 papers |
|
15:45-17:30, Subsession WeEP-09, Hall 5 | |
Theme 02. Other Imaging Applications - P2 Poster Session, 9 papers |
|
15:45-17:30, Subsession WeEP-10, Hall 5 | |
Theme 04. Systems Modeling: Fundamentals and Applications Poster Session, 10 papers |
|
15:45-17:30, Subsession WeEP-11, Hall 5 | |
Theme 05. Cardiovascular Disease Poster Session, 8 papers |
|
15:45-17:30, Subsession WeEP-12, Hall 5 | |
Theme 06. Machine Learning, Brain Signal Processing for Neurorehabilitation & Neural Engineering II Poster Session, 10 papers |
|
15:45-17:30, Subsession WeEP-13, Hall 5 | |
Theme 06. Stimulation of Neural Tissues Poster Session, 12 papers |
|
15:45-17:30, Subsession WeEP-14, Hall 5 | |
Theme 07. Human Movement Sensing P1 Poster Session, 10 papers |
|
15:45-17:30, Subsession WeEP-15, Hall 5 | |
Theme 07. Mobile Sensors and Systems P1 Poster Session, 10 papers |
|
15:45-17:30, Subsession WeEP-16, Hall 5 | |
Theme 07. Physiological and Biological Sensing P2 Poster Session, 9 papers |
|
15:45-17:30, Subsession WeEP-17, Hall 5 | |
Theme 08. Biorobotics and Biomechanics P1 Poster Session, 9 papers |
|
15:45-17:30, Subsession WeEP-18, Hall 5 | |
Theme 09. Diagnostic and Therapeutic Devices Poster Session, 7 papers |
|
15:45-17:30, Subsession WeEP-19, Hall 5 | |
Theme 09. Rehabilitation Poster Session, 3 papers |
|
15:45-17:30, Subsession WeEP-20, Hall 5 | |
Theme 10. General and Theoretical Informatics P3 Poster Session, 10 papers |
|
15:45-17:30, Subsession WeEP-21, Hall 5 | |
Theme 10. Health Informatics P2 Poster Session, 10 papers |
|
15:45-17:30, Subsession WeEP-22, Hall 5 | |
Theme 10. Imaging and Bioinformatics Poster Session, 7 papers |
|
15:45-17:30, Subsession WeEP-23, Hall 5 | |
Theme 01. Biomedical Signal Processing II Poster Session, 10 papers |
|
15:45-17:30, Subsession WeEP-24, Hall 5 | |
Theme 01. Biomedical Signal Processing III Poster Session, 10 papers |
|
15:45-17:30, Subsession WeEP-25, Hall 5 | |
Theme 02. Biomedical Imaging and Image Processing II Poster Session, 7 papers |
|
15:45-17:30, Subsession WeEP-26, Hall 5 | |
Theme 02. Biomedical Imaging and Image Processing V Poster Session, 5 papers |
|
15:45-17:30, Subsession WeEP-27, Hall 5 | |
Theme 02. Biomedical Imaging and Image Processing VI Poster Session, 8 papers |
|
15:45-17:30, Subsession WeEP-28, Hall 5 | |
Theme 03. Micro/Nano-Bioengineering Cellular/Tissue Engineering & Biomaterials Poster Session, 7 papers |
|
15:45-17:30, Subsession WeEP-29, Hall 5 | |
Theme 04. Computational Systems, Modeling and Simulation in Medicine, Multiscale Modeling & Synthetic Biology II Poster Session, 7 papers |
|
15:45-17:30, Subsession WeEP-30, Hall 5 | |
Theme 05. Cardiovascular and Respiratory Systems Engineering II Poster Session, 9 papers |
|
15:45-17:30, Subsession WeEP-31, Hall 5 | |
Theme 06. Neural and Rehabilitation Engineering II Poster Session, 10 papers |
|
15:45-17:30, Subsession WeEP-32, Hall 5 | |
Theme 06. Neural and Rehabilitation Engineering VII Poster Session, 10 papers |
|
15:45-17:30, Subsession WeEP-33, Hall 5 | |
Theme 07. Biomedical Sensors and Wearable Systems I Poster Session, 8 papers |
|
15:45-17:30, Subsession WeEP-34, Hall 5 | |
Theme 07. Biomedical Sensors and Wearable Systems IV Poster Session, 11 papers |
|
15:45-17:30, Subsession WeEP-35, Hall 5 | |
Theme 10. Biomedical & Health Informatics II Poster Session, 8 papers |
|
15:45-17:30, Subsession WeEP-36, Hall 5 | |
Theme 10. Biomedical & Health Informatics IV Poster Session, 8 papers |
|
15:45-17:30, Subsession WeEP-37, Hall 5 | |
Theme 12. Translational Engineering at the Point of Care II Poster Session, 6 papers |
|
15:45-17:30, Subsession WeEP-38, Hall 5 | |
Theme 06. Neural and Rehabilitation Engineering IV Poster Session, 10 papers |
|
WeEP-01 |
Hall 5 |
Theme 01. Deep Learning Methods for Cardiovascular Signals |
Poster Session |
|
15:45-17:30, Paper WeEP-01.1 | |
Emotion Recognition Based on Energy-Related Features of Peripheral Physiological Signals |
|
Zhu, Zhibin | Zhejiang University |
Feng, Jingwen | Zhejiang University |
Wang, Xuanyi | Zhejiang University |
Xu, Yifei | ZheJiang University |
Zhou, Huiling | Zhejiang University |
Sun, Jingjing | Royal Conservatoire of Scotland |
Jiang, Wenchen | Zhejiang University |
Chen, Hang | Zhejiang University |
Keywords: Data mining and big data methods - Biosignal classification, Time-frequency and time-scale analysis - Time-frequency analysis, Data mining and big data methods - Machine learning and deep learning methods
Abstract: The interest in development of methods and tools for recognizing human emotions has increased continuously. Using physiological information, especially the peripheral physiological signals, to identify emotions is an important direction for this area. This paper proposes an approach for emotion recognition based on energy-related features extracted from peripheral physiological signals. Three emotions: calm, happiness and fear, were elicited in 54 volunteers using video clips while three peripheral physiological signals were recorded: Electrocardiography (ECG), Photoplethysmography (PPG) and Respiration. Given that energy-related features of physiological signals are closely related to autonomic nervous systems activities, nine energy-related features were extracted from the recorded physiological signals. To find the optimal feature subset to represent the target emotions, the correlation between features and emotion state, as well as the discrimination ability of feature for emotion recognition were both analyzed. Four optimal features were then selected for further classification. Moreover, models based on Decision Tree (DT) were built to evaluate the performance of these features for purpose of recognition of emotion states of calm, happiness, and fear. The results show that the DT models based on these four optimal features could distinguish fear from calm (AUC=0.879, Accuracy=87.8%), happiness from calm (AUC=0.915, Accuracy=91.8%), and fear from happiness (AUC=0.822, Accuracy=81.8%), with a global recognition accuracy of 70.8%. These results indicate that energy-related features of peripheral physiological signals can reliably identify emotions, especially intense emotions.
|
|
15:45-17:30, Paper WeEP-01.2 | |
Muscle Artifact Removal in Single-Channel Electrocardiograms Using Temporal Convolutional Networks |
|
Boudnik, Lukas | Fraunhofer IMTE |
Graßhoff, Jan | Fraunhofer IMTE |
Vollmer, Felix | Fraunhofer IMTE |
Rostalski, Philipp | Institute for Electrical Engineering in Medicine, University Of |
Keywords: Neural networks and support vector machines in biosignal processing and classification
Abstract: The electrocardiogram (ECG) is a vital diagnostic tool used in many health applications. In practice, interference by muscle artifacts is very common and may significantly complicate interpretation of the ECG waveform. In this work we investigate the removal of muscle artifacts in single-channel ECG signals using neural network models. To this end, we compare two neural network architectures which were previously used for ECG denoising and propose a novel third method based on the ConvTasNet. The neural networks are trained on simulated data using artificial mixtures of single-channel ECG (lead II) and surface EMG signals taken from publicly available datasets. ECG data were sampled from the MIT-BIH Arrhythmia and the PTB-XL database. The former provides recordings of ambulatory ECGs while the latter contains a large variety of cardiac pathologies recorded in a clinical setting. The muscle artifacts were sampled from the MIT-BIH Noise Stress Test database and the TaiChi database. In the past, most denoising methods were only tested on the two smaller MIT-BIH datasets. In this work, we report performances on larger datasets and thus provide stronger evidence for a clinical use-case. We also report out-of distribution performance of the three methods by switching the ECG dataset between training and test. The herein investigated variant of the ConvTasNet substantially reduces interference by muscle artifacts, outperforms state-of-the-art methods and thus, may support clinical decision making.
|
|
15:45-17:30, Paper WeEP-01.3 | |
Novel Blood Pressure Waveform Reconstruction from Photoplethysmography Using Cycle Generative Adversarial Networks |
|
Asgari Mehrabadi, Milad | University of California Irvine |
Aqajari, Seyed Amir Hossein | University of California, Irvine |
Afandizadeh Zargari, Amir Hosein | University of California, Irvine |
Dutt, Nikil | UC Irvine |
Rahmani, Amir M. | Department of Computer Science, University of California Irvine, |
|
|
15:45-17:30, Paper WeEP-01.4 | |
CNN-Based Two Step R Peak Detection Method: Combining Segmentation and Regression |
|
Jang, Jaeseong | HUINNO Co., Ltd |
Kim, Jin-Kook | HUINNO Co., Ltd |
An, Junho | HUINNO Co., Ltd |
Park, Seongjae | HUINNO Co., Ltd |
Jung, Sunghoon | HUINNO Co., Ltd |
Keywords: Neural networks and support vector machines in biosignal processing and classification, Data mining and big data methods - Machine learning and deep learning methods, Data mining and big data methods - Biosignal classification
Abstract: For semantic segmentation, U-Net provides an end-to-end trainable framework to detect multiple class objects from background. Due to its great achievements in computer vision tasks, U-Net has broadened its application to biomedical signal processing, especially, segmentation of waveforms in ECG signal. Despite its superior performance for QRS complex detection to other traditional signal processing methods, direct application of the U-Net to R peak detection has limitation since the U-Net structures tend to predict high probability around true peak. Such multiple detection results require additional process to determine a unique peak location in each QRS complex. In this study, we use a regression process to detect R peak instead of pixel-wise classification. Such regression process guarantees a unique peak location prediction. We collect data from resting ECG systems and wearable ECG devices as well as public ECG databases and the proposed model is trained on various combinations of the data sources. Especially, we investigate the robustness of the model for input data from the wearable devices when the model is trained by data from heterogeneous devices.
|
|
15:45-17:30, Paper WeEP-01.5 | |
Lightweight Convolutional Neural Network for Real-Time Arrhythmia Classification on Low-Power Wearable Electrocardiograph |
|
Kim, Sangkyu | HUINNO Co., Ltd |
Chon, Sangil | HUINNO Co., Ltd |
Kim, Jin-Kook | HUINNO Co., Ltd |
Kim, Joomin | HUINNO Co., Ltd |
GIL, Yeongjoon | HUINNO Inc |
Jung, Sunghoon | HUINNO Co., Ltd |
Keywords: Neural networks and support vector machines in biosignal processing and classification, Data mining and big data methods - Machine learning and deep learning methods, Data mining and big data methods - Biosignal classification
Abstract: In this study, a lightweight CNN-based Electrocardiogram (ECG) classification model is implemented to operate it on a wearable device for real-time arrhythmia detection by efficiently reducing the number of parameters of the model. Ten second-windowed ECGs from three different public ECG databases were used to learn and classify them into four classes: normal sinus rhythm, atrial fibrillation, atrial premature contraction, and ventricular premature contraction. The model implemented in the workstation environment was converted using the TensorFlow Lite framework and then imported into an ARM Cortex-M4 architecture-based nRF52840 microprocessor. The proposed model shows high performance (97.7% accuracy and 97.4% F1 score) with reasonable execution time: 298ms and current consumption: 3.55mA at optimized for speed and execution time: 480ms and current consumption: 3.82mA at optimized for size, respectively.
|
|
15:45-17:30, Paper WeEP-01.6 | |
A U-Net Deep Learning Model for Infant Heart Rate Estimation from Ballistography |
|
Prins, Wendy S | Faculty of Electrical Engineering, Eindhoven University of Techn |
Stamatelou, Elena | Goal 3 B.v |
Dellimore, Kiran | GOAL 3 |
Likumbo, Alice | Mercy James Center for Paediatric Surgery and Intensive Care, Qu |
Kafulafula, Emmanuel | Mercy James Center for Paediatric Surgery and Intensive Care, Qu |
Langton, Josephine | Mercy James Center for Paediatric Surgery and Intensive Care, Qu |
Njirammadzi, Jenala | Mercy James Center for Paediatric Surgery and Intensive Care, Qu |
Mwenisungo, Joyce | Mercy James Center for Paediatric Surgery and Intensive Care, Qu |
Msukwa, Tushapo | Mercy James Center for Paediatric Surgery and Intensive Care, Qu |
Calis, Job | Department of Paediatrics and Child Health, Kamuzu University Of |
van Sloun, Ruud | Eindhoven University of Technology |
Bierling, Bart | Eindhoven, University of Technology |
Keywords: Neural networks and support vector machines in biosignal processing and classification, Signal pattern classification, Data mining and big data methods - Machine learning and deep learning methods
Abstract: Ballistography(BSG) is a non-intrusive and low-cost alternative to electrocardiography (ECG) for heart rate (HR) monitoring in infant. Due to the inter-patient variance and susceptibility to noise, heartbeat detection in the BSG waveform remains a challenge. The aim of this study was to estimate HR from a bed-based pressure mat BSG signal using a deep learning approach. We trained a U-Net-based deep neural network through supervised learning by deriving ground truth heartbeat time window targets from simultaneously recorded ECG signals after peak matching. For improved generalization, we modified an existing U-Net to include an IC-layer. A predictive performance of 80 % was achieved using the U-Net without the IC-layer. The inclusion of the IC-layer, while improving the generalization ability of the model to detect heartbeats, did not improve the HR estimation performance.
|
|
WeEP-02 |
Hall 5 |
Theme 01. Signal Processing & Classification of Cardiovascular Signals |
Poster Session |
|
15:45-17:30, Paper WeEP-02.1 | |
Coherence Analysis between the Surface Diaphragm EMG Envelope Signal and the Respiratory Signal Derived from the ECG in Patients Assisted by Mechanical Ventilation |
|
Arboleda, Alejandro | Universidad Autónoma De Bucaramanga (UNAB) |
Franco, Manuel Hernando Franco Arias | Universidad Autónoma De Bucaramanga |
Amado, Lusvin | Universidad Autónoma De Bucaramanga - UNAB |
Naranjo, Francisco | Clínica FOSCAL, Floridablanca |
Giraldo, Beatriz | Institute for Bioengineering of Catalonia (IBEC) |
Keywords: Nonlinear dynamic analysis - Biomedical signals, Coupling and synchronization - Coherence in biomedical signal processing
Abstract: Prolonged use of mechanical ventilation (MV) can lead to greater complications for a patient. In clinical practice, it is important to identify patients who could fail in the extubation process. However, accurately predicting the outcome of this process remains a challenge. The diaphragm muscle is one of the most active elements in the breathing process. On the other hand, there are several techniques to derive respiratory information from the ECG signal. Signals derived from diaphragmatic activity and from the ECG, such as the envelope of the surface diaphragm electromyographic signal (sEMGi) and the respiratory signal derived from the electrocardiogram (ECG) could contribute to analyze the respiratory response in patients assisted by MV. This work proposes the analysis of the coherence between sEMGi and EDR signals to determine possible differences in the respiratory pattern between successful and failed patients undergoing weaning. 40 patients with MV, candidates for weaning trial process and underwent a spontaneous breathing test were analyzed, classified into: a successful group (SG: 19 patients) that maintained spontaneous breathing after the test, and a failed group (FG: 21 patients) that required reconnection to the MV. The cross correlation, power spectral density and magnitude squared coherence (MSC) of the sEMGi and the EDR signals were estimated. According to the results, the MSC parameters such as area under the curve and mean coherence value presented statistically significance differences between the two groups of patients (p = 0.024). Our results suggest that both sEMGi and EDR signals could provide information about the behavior of the respiratory system in these patients. Clinical Relevance— This study analyzes the correlation and the coherence between the envelope of the surface electromyographic signal, and the respiratory signal derived from the ECG to characterize the respiratory pattern of successful and failed patients on weaning process.
|
|
15:45-17:30, Paper WeEP-02.2 | |
Similarity Maps for Ventricular Arrhythmia Classification |
|
Lin, Qing | King's College London |
Lam, Hak-Keung | King's College London |
Curtis, Michael | King's College London |
Cvetkovic, Zoran | King's College London |
Keywords: Nonlinear dynamic analysis - Biomedical signals, Signal pattern classification, Data mining and big data methods - Biosignal classification
Abstract: Ventricular arrhythmias are the primary arrhythmias that cause sudden cardiac death. In current clinical and preclinical research, the discovery of new therapies and their translation is hampered by the lack of consistency in diagnostic criteria for distinguishing between ventricular tachycardia (VT) and ventricular fibrillation (VF). This study develops a new set of features, similarity maps, for discrimination between VT and VF using deep neural network architectures. The similarity maps are designed to capture the similarity and the regularity within an ECG trace. Our experiments show that the similarity maps lead to a substantial improvement in distinguishing VT and VF.
|
|
15:45-17:30, Paper WeEP-02.3 | |
Automatic Pain Assessment on Cancer Patients Using Physiological Signals Recorded in Real-World Contexts |
|
Moscato, Serena | University of Bologna |
Orlandi, Silvia | University of Bologna |
Giannelli, Andrea | ANT Foundation |
Ostan, Rita | ANT Foundation |
Chiari, Lorenzo | University of Bologna |
Keywords: Physiological systems modeling - Multivariate signal processing, Signal pattern classification, Neural networks and support vector machines in biosignal processing and classification
Abstract: Pain assessment represents the first fundamental stage for proper pain management, but currently, methods applied in clinical practice often lack in providing a satisfying characterization of the pain experience. Automatic methods based on the analysis of physiological signals (e.g., photoplethysmography, electrodermal activity) promise to overcome these limitations, also providing the possibility to record these signals through wearable devices, thus capturing the physiological response in everyday life. After applying pre-processing, feature extraction and feature selection methods, we tested several machine learning algorithms to develop an automatic classifier fed with physiological signals recorded in real-world contexts and pain ratings from 21 cancer patients. The best algorithm achieved up to 72% accuracy. Although performance can be improved by enlarging the dataset, preliminary results proved the feasibility of assessing pain by using physiological signals recorded in real-world contexts.
|
|
15:45-17:30, Paper WeEP-02.4 | |
Quantifying Respiration Effects on Cardiac Vibrations Using Teager Energy Operator and Gradient Boosted Trees |
|
Imirzalioglu, Mine | Koc University |
Semiz, Beren | Koc University |
Keywords: Physiological systems modeling - Signal processing in physiological systems, Nonlinear dynamic analysis - Biomedical signals, Data mining and big data methods - Machine learning and deep learning methods
Abstract: This work proposes a novel beat scoring system for quantifying the effects of exhalation and inhalation on the seismocardiogram (SCG) signals in rest and physiologically modulated conditions. Data from 19 subjects during rest, listening to classical music and recovery states were used. First, the SCG and electrocardiogram (ECG) signals were segmented into exhalation and inhalation phases using the respiration signal; and a representative SCG beat for each exhale and inhale phase was constructed using the ECG R-peak locations. Second, the significant differences across the exhalation- and inhalation-induced SCG beats were detected and extracted using the Teager-Kaiser energy operator. Finally, a gradient-based beat scoring system was developed using extreme gradient boosted trees and monotonic mapping. For the rest, classical music and recovery sessions, the area under the receiver operating characteristic curve was found to be 0.978, 0.874, 0.985, respectively. On the other hand, the kernel density estimation distributions of the inhalation and exhalation scores had an overlap of 14.2%, 41.2%, 10.6%, respectively. Overall, our results show that different physiological modulations directly change the effect of respiration on the SCG morphology, thus standardization across the beats should be studied for achieving more reliable and accurate investigation of cardiovascular parameters.
|
|
15:45-17:30, Paper WeEP-02.5 | |
Validation of Dozee, a Ballistocardiography-Based Device, for Contactless and Continuous Heart Rate and Respiratory Rate Measurement |
|
Madhu, Vibhor | Turtle Shell Technologies Private Limited |
Kumar, Ramendra | Turtle Shell Technologies Private Limited |
Kumar, Gulshan | Department of Neurophysiology National Institute of Mental Healt |
Chokalingam, Kumar | Turtle Shell Technologies Private Limited |
Rawooth, Madhusmita | Turtle Shell Technologies Private Limited |
Parchani, Gaurav | Turtle Shell Technologies Private Limited |
Keywords: Signal pattern classification, Data mining and big data methods - Machine learning and deep learning methods, Physiological systems modeling - Signal processing in physiological systems
Abstract: Long-term acquisition of respiratory and heart signals is useful in a variety of applications, including sleep analysis, monitoring of respiratory and heart disorders, and so on. Ballistocardiography (BCG), a non-invasive technique that measures micro-body vibrations caused by cardiac contractions as well as motion caused by breathing, snoring, and body movements, would be ideal for long-term vital parameter acquisition. Turtle Shell Technologies Pvt. Ltd.'s Dozee device, which is based on BCG, is a contactless continuous vital parameters monitoring system. It is designed to measure Heart Rate (HR) and Respiratory Rate (RR) continuously and without contact in a hospital setting or at home. A validation study for HR and RR was conducted using Dozee by comparing it to the vitals obtained from the FDA-approved Patient Monitor. This was done in a sleep laboratory setting over 110 nights in 51 subjects to evaluate HR and over 20 nights in 17 subjects to evaluate RR at the National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India. Approximately 789 hours data for HR and approximately 112 hours data for RR was collected. Dozee was able to achieve a mean absolute error of 1.72 bpm for HR compared to the gold standard ECG. A mean absolute error of ~1.24 breaths/min was obtained in determining RR compared to currently used methods. Dozee is ideal for long-term contactless monitoring of vital parameters due to its low mean absolute errors in measuring both HR and RR.
|
|
15:45-17:30, Paper WeEP-02.6 | |
Classification of Sleep-Wake State in Ballistocardiogram System Based on Deep Learning |
|
Ahmed, Nemath | Turtle Shell Technologies Private Limited |
KS, Srivyshnav | Turtle Shell Technologies Private Limited |
Chokalingam, Kumar | Turtle Shell Technologies Private Limited |
Rawooth, Madhusmita | Turtle Shell Technologies Private Limited |
Kumar, Gulshan | Department of Neurophysiology National Institute of Mental Healt |
Parchani, Gaurav | Turtle Shell Technologies Private Limited |
Saran, Vibhor | Turtle Shell Technologies Private Limited |
Keywords: Signal pattern classification, Data mining and big data methods - Machine learning and deep learning methods, Physiological systems modeling - Signal processing in physiological systems
Abstract: Sleep state classification is essential for managing and comprehending sleep patterns, and it is usually the first step in identifying sleep disorders. Polysomnography (PSG), the gold standard, is intrusive and inconvenient for regular/long-term sleep monitoring. Many sleep-monitoring techniques have recently seen a resurgence as a result of the rise of neural networks and advanced computing. Ballistocardiography (BCG) is an example of such a technique , in which vitals are monitored in a contactless and unobtrusive manner by measuring the body's reaction to cardiac ejection forces. A Multi-Headed Deep Neural Network is proposed in this study to accurately classify sleep-wake state and predict sleep-wake time using BCG sensors. This method achieves a 95.5% sleep-wake classification score. Two studies were conducted in a controlled and uncontrolled environment to assess the accuracy of sleep-awake time prediction. Sleep-awake time prediction achieved an accuracy score of 94.16% in a controlled environment on 115 subjects and 94.90% in an uncontrolled environment on 350 subjects. The high accuracy and contactless nature make this proposed system a convenient method for long-term monitoring of sleep states, and it may also aid in identifying sleep stages and other sleep-related disorders.
|
|
15:45-17:30, Paper WeEP-02.7 | |
Multi-Chain Semi-Markov Analysis of Intrapartum Cardiotocography |
|
Vargas-Calixto, Johann | McGill University |
Wu, Yvonne | University of California, San Francisco |
Kuzniewicz, Michael | Kaiser Permanente |
Cornet, Marie-Coralie | University of California, San Francisco |
Forquer, Heather | Kaiser Permanente |
Gerstley, Lawrence | Kaiser Permanente |
Hamilton, Emily | McGill University |
Warrick, Philip A. | Perigen Canada |
Kearney, Robert Edward | McGill University |
Keywords: Signal pattern classification - Markov models, Physiological systems modeling - Multivariate signal processing, Data mining and big data methods - Patient outcome and risk analysis
Abstract: Visual assessment of the evolution of fetal heart rate (FHR) and uterine pressure (UP) patterns is the standard of care in the intrapartum period. Unfortunately, this assessment has high levels of intra- and inter-observer variability. This study processed and analyzed FHR and UP patterns using computerized pattern recognition tools. The goal was to evaluate differences in FHR and UP patterns between fetuses with normal outcomes and those who developed hypoxic-ischemic encephalopathy (HIE). For this purpose, we modeled the sequence of FHR patterns and uterine contractions using Multi-Chain Semi-Markov models (MCSMMs). These models estimate the probability of transitioning between FHR or UP patterns and the dwell time of each pattern. Our results showed that in comparison to the control group, the HIE group had: (1) more frequent uterine contractions during the last 12 hours before birth; (2) more frequent FHR decelerations during the last 12 hours before birth; (3) longer decelerations during the last eight hours before birth; and (4) shorter baseline durations during the last five hours before birth. These results demonstrate that the fetuses in the HIE group were subject to a more stressful environment than those in the normal group.
|
|
15:45-17:30, Paper WeEP-02.8 | |
Heart Rate Detection Using Single-Channel Doppler Radar System |
|
Sameera, Jannatun Noor | University of Hawaii at Manoa |
Droitcour, Amy | Wave 80 Biosciences |
Boric-Lubecke, Olga | University of Hawaii Manoa |
Keywords: Time-frequency and time-scale analysis - Time-frequency analysis
Abstract: A number of algorithms have been developed to extract heart rate from physiological motion data with the Doppler radar system. Yet, it is very challenging to eliminate the noise associated with surroundings, especially with a single-channel Doppler radar system. However, single-channel Doppler radars provide the advantage of operating at lower power. Additionally, the heart rate extraction using single-channel Doppler radar has remained somewhat unexplored. This has motivated us to develop effective signal processing algorithms for signals received from single-channel Doppler radars. In this paper, we have proposed and studied three algorithms for estimating heart rate. The first algorithm is based on applying FFT on an FIR filtered signal. In the second algorithm, autocorrelation was performed on the filtered data. Thirdly, we used a peak finding algorithm in conjunction with a moving average preceded by a clipper to determine the heart rate. The results obtained were compared with the heart rate readings from a pulse oximeter. With a mean difference of 2.6 bpm, the heart rate from Doppler radar matched that from the pulse oximeter most frequently when the peak finding algorithm was used. The results obtained using autocorrelation and peak finding algorithm (with standard deviations of 2.6 bpm and 4.0 bpm) suggest that a single channel Doppler radar system can be a viable alternative to contact heart rate monitors in patients for whom contact measurements are not feasible.
|
|
15:45-17:30, Paper WeEP-02.9 | |
A New Framework for Modeling the Bidirectional Interplay between Brain Oscillations and Cardiac Sympathovagal Activity |
|
Candia-Rivera, Diego | Universita Di Pisa |
Catrambone, Vincenzo | Università Di Pisa |
Barbieri, Riccardo | Politecnico Di Milano |
Valenza, Gaetano | University of Pisa |
Keywords: Time-frequency and time-scale analysis - Time-frequency analysis, Physiological systems modeling - Signal processing in physiological systems, Directionality
Abstract: The study of functional brain-heart interplay (BHI) aims to describe the dynamical interactions between central and peripheral autonomic nervous systems. Here, we introduce the Sympathovagal Synthetic Data Generation Model, which constitutes a new computational framework for the assessment of functional BHI. The model estimates the bidirectional interplay with novel quantifiers of cardiac sympathovagal activity gathered from Laguerre expansions of RR series (from the ECG), as an alternative to the classical spectral analysis. The main features of the model are time-varying coupling coefficients linking Electroencephalography (EEG) oscillations and cardiac sympathetic or parasympathetic activity, for either ascending or descending direction of the information flow. In this proof-of-concept study, functional BHI is quantified in the direction from-heart-to-brain, on data from 16 human volunteers undergoing a cold-pressor test. Results show that thermal stress induces heart-to-brain functional interplay originating from sympathetic and parasympathetic activities and sustaining EEG oscillations mainly in the δ and γ bands. The proposed computational framework could provide a viable tool for the functional assessment of the causal interplay between cortical and cardiac sympathovagal dynamics.
|
|
15:45-17:30, Paper WeEP-02.10 | |
Motion-Based Respiratory Rate Estimation with Motion Artifact Removal Using Video of Face and Upper Body |
|
Gwak, Migyeong | University of California, Los Angeles |
Vatanparvar, Korosh | Samsung Research America |
Kuang, Jilong | Samsung Research America |
Gao, Alex | Samsung Research America |
Keywords: Time-frequency and time-scale analysis - Time-frequency analysis, Parametric filtering and estimation, Principal and independent component analysis - Blind source separation
Abstract: Respiratory rate (RR) is a significant indicator of health conditions. Remote contactless measurement of RR is gaining popularity with recent respiratory tract infection awareness. Among various methods of contactless RR measurement, a video of an individual can be used to obtain an instantaneous RR. In this paper, we introduce an RR estimation based on the subtle motion of the head or upper chest captured on an RGB camera. Motion-based respiratory monitoring allows us to acquire RR from individuals with partial face coverings, such as glasses or a face mask. However, motion-based RR estimation is vulnerable to the subject's voluntary movement. In this work, adaptive selection between face and chest regions plus a motion artifact removal technique enables us to obtain a much cleaner respiratory signal from the video recordings. The average mean absolute error (MAE) for controlled and natural breathing is 1.95 BPM using head motion only and 1.28 BPM using chest motion only. Our results demonstrate the possibility of continuous monitoring of breathing rate in real-time with any personal device equipped with an RGB camera, such as a laptop or a smartphone.
|
|
15:45-17:30, Paper WeEP-02.11 | |
Analysis of the Skin Conductance and Pupil Signals for Evaluation of Emotional Elicitation by Images and Sounds |
|
Polo, Edoardo Maria | Sapienza, University of Rome |
Farabbi, Andrea | Politecnico Di Milano |
Mollura, Maximiliano | Politecnico Di Milano |
Paglialonga, Alessia | CNR National Research Council of Italy |
Mainardi, Luca | Politecnico Di Milano |
Barbieri, Riccardo | Politecnico Di Milano |
Keywords: Time-frequency and time-scale analysis - Time-frequency analysis, Physiological systems modeling - Signal processing in physiological systems
Abstract: Many studies in the literature attempt recognition of emotions through the use of videos or images, but very few have explored the role that sounds have in evoking emotions. In this study we have devised an experimental protocol for elicitation of emotions by using, separately and jointly, images and sounds from the widely used International Affective Pictures System and International Affective Digital Sounds databases. During the experiments we have recorded the skin conductance and pupillary signals and processed them with the goal of extracting indices linked to the autonomic nervous system, thus revealing specific patterns of behavior depending on the different stimulation modalities. Our results show that skin conductance helps discriminate emotions along the arousal dimension, whereas features derived from the pupillary signal are able to discriminate different states along both valence and arousal dimensions. In particular, the pupillary diameter was found to be significantly greater at increasing arousal and during elicitation of negative emotions in the phases of viewing images and images with sounds. In the sound-only phase, on the other hand, the power calculated in the high and very high frequency bands of the pupillary diameter were significantly greater at higher valence (valence ratings > 5).
|
|
WeEP-03 |
Hall 5 |
Theme 01. Signal Processing & Classification of Speech and Acoustic Signals |
Poster Session |
Chair: Bondareva, Erika | University of Cambridge |
|
15:45-17:30, Paper WeEP-03.1 | |
A Deep Learning Based Approach to Synthesize Intelligible Speech with Limited Temporal Envelope Information |
|
Hsiao, Ching-Ju | National Yang Ming Chiao Tung University |
Chen, Fei | Southern University of Science and Technology |
Han, Ji Yan | Yang Ming |
Zheng, Wei-Zhong | National Yang Ming University |
Lai, Ying-Hui | National Yang Ming Chiao Tung University |
Keywords: Neural networks and support vector machines in biosignal processing and classification
Abstract: Envelope waveforms can be extracted from multiple frequency bands of a speech signal, and envelope waveforms carry important intelligibility information for human speech communication. This study aimed to investigate whether a deep learning-based model with features of temporal envelope information could synthesize an intelligible speech, and to study the effect of reducing the number (from 8 to 2 in this work) of temporal envelope information on the intelligibility of the synthesized speech. The objective evaluation metric of short-time objective intelligibility (STOI) showed that, on average, the synthesized speech of the proposed approach provided higher STOI (i.e., 0.8) scores in each test condition; and the human listening test showed that the average word correct rate of eight listeners was higher than 97.5%. These findings indicated that the proposed deep learning-based system can be a potential approach to synthesize a highly intelligible speech with limited envelope information in the future.
|
|
15:45-17:30, Paper WeEP-03.2 | |
Decoding Neural Correlation of Language-Specific Imagined Speech Using EEG Signals |
|
Lee, Keon-Woo | Korea University |
Lee, Dae-Hyeok | Korea University |
Kim, Sung-Jin | Korea University |
Lee, Seong-Whan | Korea University |
Keywords: Nonlinear dynamic analysis - Biomedical signals, Physiological systems modeling - Signal processing in physiological systems, Time-frequency and time-scale analysis - Time-frequency analysis
Abstract: Speech impairments due to cerebral lesions and degenerative disorders can be devastating. For humans with severe speech deficits, imagined speech in the brain-computer interface has been a promising hope for reconstructing the neural signals of speech production. However, studies in the EEG-based imagined speech domain still have some limitations due to high variability in spatial and temporal information and low signal-to-noise ratio. In this paper, we investigated the neural signals for two groups of native speakers with two tasks with different languages, English and Chinese. Our assumption was that English, a non-tonal and phonogram-based language, would have spectral differences in neural computation compared to Chinese, a tonal and ideogram-based language. The results showed the significant difference in the relative power spectral density between English and Chinese in specific frequency band groups. Also, the spatial evaluation of Chinese native speakers in the theta band was distinctive during the imagination task. Hence, this paper would suggest the key spectral and spatial information of word imagination with specialized language while decoding the neural signals of speech.
|
|
15:45-17:30, Paper WeEP-03.3 | |
Language-Independent Sleepy Speech Detection |
|
Moon, Jihye | University of Connecticut |
Kong, Youngsun | University of Connecticut |
Chon, Ki | University of Connecticut |
Keywords: Data mining and big data methods - Biosignal classification, Signal pattern classification, Data mining and big data methods - Machine learning and deep learning methods
Abstract: Prolonged sleepiness can lead to impairment of cognitive and physical performance and may cause unfortunate accidents. Speech signals are easily accessible using a simple microphone or other means, hence, automated approaches for accurate sleepiness detection from speech signals are desired to prevent degradation in human performance and accidental injury. Sleepiness is known to affect acoustic patterns of speech so that they are different from those of normal speech, and this change is also independent of the language being spoken. To date, there have been no studies examining linguistic-independent sleepy speech detection. We used two different languages, English and German, to detect sleepy speech, where the former was used to train/validate and the latter to test the effectiveness of machine and deep learning models. Specifically, we trained ResNet50, a deep learning model, and five machine learning models with relevant vocal features. Speech data segments from three English-speaking subjects were used for training the model and segments from an English-speaking subject were used for validation. We then tested ResNet50 and the five different machine-learning models using speech data segments from one German-speaking subject. Deep learning far outperformed all of the machine learning approaches. The accuracy, sensitivity, specificity, and geometric mean values were found to be 0.96, 0.92, 0.99, and 0.95, respectively, using ResNet50 on the test data. Our preliminary results suggest that sleepiness can be accurately detected independently from linguistic speech.
|
|
15:45-17:30, Paper WeEP-03.4 | |
Stress Inference from Abdominal Sounds Using Machine Learning |
|
Bondareva, Erika | University of Cambridge |
Constantinides, Marios | Nokia Bell Labs |
Eggleston, Michael S. | Nokia Bell Labs |
Jabłoński, Ireneusz | Wroclaw University of Technology |
Mascolo, Cecilia | University of Cambridge |
Radivojevic, Zoran | Nokia Bell Labs |
Šćepanović, Sanja | Nokia Bell Labs |
Keywords: Neural networks and support vector machines in biosignal processing and classification, Data mining and big data methods - Biosignal classification
Abstract: Stress is often considered the 21st century's epidemic, affecting more than a third of the globe's population. Long-term exposure to stress has significant side effects on physical and mental health. In this work we propose a methodology for detecting stress using abdominal sounds. For this study, eight participants were either exposed to a stressful (Stroop test) or a relaxing (guided meditation) stimulus for ten days. In total, we collected 104 hours of abdominal sounds using a custom wearable device in a belt form-factor. We explored the effect of various features on the binary stress classification accuracy using traditional machine learning methods. Namely, we observed the impact of using acoustic features on their own, as well as in combination with features representing current mood state, and hand-crafted domain-specific features. After feature extraction and reduction, by utilising a multilayer perceptron classifier model we achieved 77% accuracy in detecting abdominal sounds under stress exposure. Clinical relevance – This feasibility study confirms the link between the gastrointestinal system and stress and uncovers a novel approach for stress inference via abdominal sounds using machine learning.
|
|
15:45-17:30, Paper WeEP-03.5 | |
The Robustness of Random Forest and Support Vector Machine Algorithms to a Faulty Heart Sound Segmentation |
|
Oliveira, Jorge | Universidade Portucalense Infante D. Henrique |
Nogueira, Diogo Marcelo | INESC TEC |
Alípio, Jorge | INESC-TEC |
Ferreira, Carlos | Liaad - Inesc Tec |
Coimbra, Miguel | INESC TEC / Universidade Do Porto |
Keywords: Physiological systems modeling - Signal processing in physiological systems, Nonlinear dynamic analysis - Biomedical signals, Neural networks and support vector machines in biosignal processing and classification
Abstract: Cardiac auscultation is the key exam to screen cardiac diseases both in developed and developing countries. A heart sound auscultation procedure can detect the presence of murmurs and point to a diagnosis, thus it is an important first-line assessment and also cost-effective tool. The design automatic recommendation systems based on heart sound auscultation can play an important role in boosting the accuracy and the pervasiveness of screening tools. One such as step, consists in detecting the fundamental heart sound states, a process known as segmentation. A faulty segmentation or a wrong estimation of the heart rate might result in an incapability of heart sound classifiers to detect abnormal waves, such as murmurs. In the process of understanding the impact of a faulty segmentation, several common heart sound segmentation errors are studied in detail, namely those where the heart rate is badly estimated and those where S1/S2 and Systolic/Diastolic states are swapped in comparison with the ground truth state sequence. From the tested algorithms, support vector machine (SVMs) and random forest (RFs) shown to be more sensitive to a wrong estimation of the heart rate (an expected drop of 6% and 8% on the overall performance, respectively) than to a swap in the state sequence of events (an expected drop of 1.9% and 4.6%, respectively).
|
|
15:45-17:30, Paper WeEP-03.6 | |
Automatic Identification of Snoring and Groaning Segments in Acoustic Recordings |
|
Kok, Xuen Hoong | Imperial College London |
Imtiaz, Syed Anas | Imperial College London |
Rodriguez-Villegas, Esther | Imperial College London |
Keywords: Signal pattern classification, Time-frequency and time-scale analysis - Time-frequency analysis
Abstract: Sleep-related breathing disorders have severe impact on the quality of lives of those suffering from them. These disorders present with a variety of symptoms, out of which snoring and groaning are very common. This paper presents an algorithm to identify and classify segments of acoustic respiratory sound recordings that contain both groaning and snoring events. The recordings were obtained from a database containing 20 subjects from which features based on the Mel-frequency cepstral coefficients (MFCC) were extracted. In the first stage of the algorithm, segments of recordings consisting of either snoring or groaning episodes - without classifying them - were identified. In the second stage, these segments were further differentiated into individual groaning or snoring events. The algorithm in the first stage achieved a sensitivity and specificity of 90.5% ± 2.9% and 90.0% ± 1.6% respectively, using a RUSBoost model. In the second stage, a random forest classifier was used, and the accuracies for groan and snore events were 78.1% ± 4.7% and 78.4% ± 4.7% respectively.
|
|
15:45-17:30, Paper WeEP-03.7 | |
Synchrosqueezed Transform Based Click Event Segmentation in Phonocardiogram of Mitral Valve Prolapse |
|
BS, Rajeshwari | Indian Institute of Technology, Kharagpur |
Sinha, Aman | IIT Kharagpur |
Sengupta, Arnab | University College London |
Patra, Madhurima | Kalyani Government Engineering College |
Sahoo, Karuna Prasad | Indian Institute of Technology, Kharagpur |
Ghosh, Nirmalya | Indian Institute of Technology (IIT), Kharagpur |
Keywords: Time-frequency and time-scale analysis - Time-frequency analysis, Neural networks and support vector machines in biosignal processing and classification
Abstract: Phonocardiogram (PCG) signal of the mitral valve prolapse (MVP) patients is characterized by transient audio events which include a systolic click (SC) followed by a murmur of varying intensity. Physicians detect these auscultation clues in regular auscultation before ordering expensive echocardiography test. But auscultation is often error prone and even physicians with considerable experience might end up missing these clues. Therefore developing machine learning techniques to help clinicians is the need of the hour. A segmentation technique using Fourier synchrosqueezed transform (FSST) features with a long short term memory (LSTM) network is proposed in this study. An accuracy of 99:8% on MVP dataset demonstrates the potential of the proposed method in clinical diagnosis.
|
|
WeEP-04 |
Hall 5 |
Theme 01. Signal Processing and Classification of Cardiac Signals |
Poster Session |
|
15:45-17:30, Paper WeEP-04.1 | |
Normal and Abnormal Classification of Electrocardiogram: A Primary Screening Tool Kit |
|
Kirodiwal, Akash | Indian Institute of Technology, Kharagpur |
DHALADHULI, JAHNAVI | Indian Institute of Technology Kharagpur |
DASH, ASHUTOSH | Indian Institute of Technology, Kharagpur |
Ghosh, Nirmalya | Indian Institute of Technology (IIT), Kharagpur |
Patra, Amit | Indian Institute of Technology Kharagpur |
Keywords: Data mining and big data methods - Biosignal classification, Data mining and big data methods - Machine learning and deep learning methods, Physiological systems modeling - Signal processing in physiological systems
Abstract: Cardiovascular diseases (CVDs) are one of the principal causes of death. Cardiac arrhythmia, a critical CVD, can be easily detected from an electrocardiogram (ECG) recording. Automated ECG analysis can help clinicians to identify arrhythmia and prevent untimely death. This paper presents a simple model to classify the ECG recordings into two classes: Normal and Abnormal based on morphological and heart rate variability (HRV) features. Before feature extraction, Signal quality analysis (SQA) is performed to abandon poor quality ECG signals. Several machine-learning classifiers such as Support Vector Machine (SVM), Adaboost (AB), Random Forest(RF), Extra-Tree Classifier (ET), Decision Tree (DT), Artificial Neural Network (ANN), K-Nearest Neighbors (KNN), Logistic Regression (LR), Naïve Bayes (NB), and Gradient Boosting (GB) are explored on the extracted feature space. To enhance the study, few feature selection algorithms such as F test, Least Absolute Shrinkage and Selection Operator (LASSO), and Minimal Redundancy Maximal Relevance (mRMR) algorithms are also applied and the outcomes of each algorithm along with the considered classifiers are analyzed and compared. The proposed algorithm is validated on 2648 Normal and 2518 Abnormal ECG recordings. The accuracy of our best classifier is found to be 95.25 %. It is anticipated that the proposed model will be helpful as a primary and mass screening tool kit in clinical settings.
|
|
15:45-17:30, Paper WeEP-04.2 | |
A Novel ECG Denoising Scheme Using the Ensemble Kalman Filter |
|
Sarafan, Sadaf | University of California, Irvine |
Hoang Vuong, Hoang | SENSORIIS, INC |
Jilani, Daniel | University of California, Irvine |
Malhotra, Samir | University of California, Irvine |
Vishwanath, Manoj | Donald Bren School of Information and Computer Sciences, UC Irvi |
Lau, Michael | SENSORIIS, INC |
Ghirmai, Tadesse | University of Washington Bothell |
Cao, Hung | University of California, Irvine |
Keywords: Kalman filtering, Nonlinear dynamic analysis - Nonlinear filtering, Physiological systems modeling - Signal processing in physiological systems
Abstract: Monitoring of electrocardiogram (ECG) provides vital information as well as any cardiovascular anomalies. Recent advances in the technology of wearable electronics have enabled compact devices to acquire personal physiological signals in the home setting; however, signals are usually contaminated with high-level noise. Thus, an efficient ECG filtering scheme is a dire need. In this paper, a novel method using Ensemble Kalman Filter (EnKF) is developed for denoising ECG signals. We also intensively explore various filtering algorithms, including Savitzky-Golay (SG) filter, Ensemble Empirical mode decomposition (EEMD), Normalized Least-Mean-Square (NLMS), Recursive least squares (RLS) filter, Total variation denoising (TVD), Wavelet and extended Kalman filter (EKF) for comparison. Data from the MIT-BIH Noise Stress Test database were used. The proposed methodology shows the average signal-to-noise ratio of 10.96, the Percentage Root Difference of 150.45, and the correlation coefficient of 0.959 from the modified MIT-BIH database with added motion artifacts. Our results demonstrate that the proposed algorithm is reliable and effective as it could provide original ECG signals even under strong noise conditions.
|
|
15:45-17:30, Paper WeEP-04.3 | |
A Generalized Framework for Pacing Artifact Removal Using a Hampel Filter |
|
Nagahawatte, Nipuni | The University of Auckland |
Cheng, Leo K | The University of Auckland |
Avci, Recep | The University of Auckland |
Bear, Laura R | IHU-Liryc, Fondation Bordeaux Université |
Paskaranandavadivel, Niranchan | The University OfAuckland |
Keywords: Physiological systems modeling - Signal processing in physiological systems, Parametric filtering and estimation, Time-frequency and time-scale analysis - Time-frequency analysis
Abstract: Cardiac pacing is a clinical therapy widely used for treating irregular heart rhythms. Equivalent techniques for the treatment of gastric functional motility disorders hold great potential. Accurate analysis of pacing studies is often hindered by the stimulus artifacts which are superimposed on the recorded signals. This paper presents a semi-automated artifact detection method using a Hampel filter accompanied by 2 separate artifact reconstruction methods: (i) weighted mean and (ii) an auto-regressive model, to estimate the underlying signal. The developed framework was validated on synthetic and experimental signals containing large periodic pacing artifacts alongside evoked bioelectrical events. The performance of the proposed algorithms was quantified for gastric and cardiac pacing data collected in vivo. A lower mean RMS difference was achieved by the artifact segment reconstructed using the auto regression (0.37 ± 0.49 mV) method compared to the weighted mean (0.69 ± 0.69 mV) method. Therefore, a more accurate artifact reconstruction was provided by the auto regression approach.
|
|
15:45-17:30, Paper WeEP-04.4 | |
A Stochastic Resonance P and T-Wave Detection Algorithm |
|
Güngör, Cihan Berk | University of California San Diego |
Mercier, Patrick P. | University of California, San Diego |
Töreyin, Hakan | San Diego State University |
Keywords: Physiological systems modeling - Signal processing in physiological systems, Physiological systems modeling - Signal processing in simulation, Nonlinear dynamic analysis - Biomedical signals
Abstract: An algorithm to detect P- and T-waves in an electrocardiogram (ECG) signal is presented. The algorithm has physical origins inspired by weak signal detection by leveraging stochastic resonance (SR) in a well potential. Specifically, a particle inside an underdamped monostable well is introduced with the ECG signal. The parameters defining the well and system characteristics are optimized towards enhancing the P-, R-, and T-waves while suppressing the other portions including noise-only sections. The enhanced features are detected by thresholding. Based on the performance obtained from the QT database, the algorithm achieves an average sensitivity of 99.97% for P-waves and an average sensitivity of 99.35% for T-waves, better than most P- and T-wave detection algorithms reported.
|
|
15:45-17:30, Paper WeEP-04.5 | |
Fetal Movement Cancellation in Abdominal Electrocardiogram Recordings Using Signal-To-Signal Translation |
|
Arash Shokouhmand, Arash | Stevens Institute of Technology |
Tavassolian, Negar | Stevens Institute of Technology |
Keywords: Physiological systems modeling - Systems identification, Signal pattern classification, Neural networks and support vector machines in biosignal processing and classification
Abstract: Abstract— This study addresses the cancellation of fetal movement in abdominal electrocardiogram (AECG) recordings through deep neural networks. For this purpose, a generative signal-to-signal translation model consisting of two coupled generators is employed to discover the relations between fetal movement-contaminated and clean AECG recordings. The model is trained on the fetal ECG synthetic database (FECGSYNDB) which provides AECG recordings from 10 pregnancies along with their ground-truth maternal and fetal ECG signals. The signals are initially segmented into 4-second segments and then fed into the network for denoising. It is demonstrated that the signal-to-signal translation method can reconstruct clean AECG signals with average mean-absolute-error (MAE), root-mean-square deviation (RMSD), and Pearson correlation coefficient (PCC) of 0.099, 0.124, and 99.12% respectively, between clean and denoised AECG signals. Furthermore, the robustness of the method to low signal-to-noise ratio (SNR) input values is shown by an RMSD range of (0.047, 0.352) for SNR values within the range of (-3, 3) dB. Clinical Relevance— The proposed framework allows for the denoising of abdominal ECG signals for non-invasive fetal heart rate monitoring. The approach is accurate due to the use of advanced neural network techniques.
|
|
15:45-17:30, Paper WeEP-04.6 | |
Morphological Event Based Signal Quality Assessment of Electrocardiogram |
|
Agrawal, Akshat | Indian Institute of Technology Kharagpur |
DASH, ASHUTOSH | Indian Institute of Technology, Kharagpur |
Ghosh, Nirmalya | Indian Institute of Technology (IIT), Kharagpur |
Patra, Amit | Indian Institute of Technology Kharagpur |
Keywords: Time-frequency and time-scale analysis - Nonstationary analysis and modeling, Nonlinear dynamic analysis - Biomedical signals, Signal pattern classification
Abstract: ECG signals acquired from mobile devices by unskilled users are corrupted with several noises. Poor signal quality may result in an increased number of false alarms, degrading diagnostic performance, and increasing the burden on the doctors in decoding the information for further clinical intervention. So, it is necessary to assess the quality of the signals before doing any further processing. This paper presents a method for accessing the reliability of ECG signals obtained from wearable sensors. A morphological event-based quality assessment method is proposed where a signal will be classified as GOOD/BAD. Results show that our method can achieve an accuracy = 92% with sensitivity = 0.98 and specificity = 0.59.
|
|
WeEP-05 |
Hall 5 |
Theme 02. Image Analysis and Classification - Machine Learning / Deep
Learning Approaches - P2 |
Poster Session |
|
15:45-17:30, Paper WeEP-05.1 | |
Supervised and Semi-Supervised Training of Deep Convolutional Neural Networks for Gastric Landmark Detection |
|
Lopes, Inês Filipa Fernandes Videira | INESC TEC Porto |
Silva, Augusto | University of Aveiro |
Coimbra, Miguel | INESC TEC / Universidade Do Porto |
Dinis-Ribeiro, Mário | Instituto Português De Oncologia - Porto |
Libânio, Diogo | CIDES/CINTEIS, Faculty of Medicine, University of Porto |
Renna, Francesco | INESC TEC E Faculdade De Ciências Da Universidade Do Porto |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image classification, Image feature extraction
Abstract: This work focuses on detection of upper gastrointestinal (GI) landmarks, which are important anatomical areas of the upper GI tract digestive system that should be photodocumented during endoscopy to guarantee a complete examination. The aim of this work consisted in testing new automatic algorithms, specifically based on convolutional neural network (CNN) systems, able to detect upper GI landmarks, that can help to avoid the presence of blind spots during esophagogastroduodenoscopy. We tested pre-trained CNN architectures, such as the ResNet-50 and VGG-16, in conjunction with different training approaches, including the use of class weights, batch normalization, dropout, and data augmentation. The ResNet-50 model trained with class weights was the best performing CNN, achieving an accuracy of 71.79% and a Mathews Correlation Coefficient (MCC) of 65.06%. The combination of supervised and unsupervised learning was also explored to increase classification performance. In particular, convolutional autoencoder architectures trained with unlabeled GI images were used to extract representative features. Such features were then concatenated with those extracted by the pre-trained ResNet-50 architecture. This approach achieved a classification accuracy of 72.45% and an MCC of 65.08%.
|
|
15:45-17:30, Paper WeEP-05.2 | |
Prediction of Human Induced Pluripotent Stem Cell Formation Based on Deep Learning Analyses Using Time-Lapse Brightfield Microscopy Images |
|
Chu, Slo-Li | Chung Yuan Christian University |
Sudo, Kazuhiro | BioResouce Center, RIKEN |
Abe, Kuniya | Mammalian Genome Dynamics, RIKEN BioResource Center |
Yokota, Hideo | RIKEN Center for Advanced Photonics |
Nakamura, Yukio | RIKEN BioResource Center |
Liou, Guan-Ting | Chung Yuan Christian University |
Tsai, Ming-Dar | Chung-Yuan Christian University |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image segmentation, Optical imaging and microscopy - Microscopy
Abstract: We use deep learning methods to predict human induced pluripotent stem cell (hiPSC) formation using time-lapse brightfield microscopy images taken from a cell identified as the beginning of entered into the reprogramming process. A U-net is used to segment cells and a CNN is used to classify the segmented cells into eight types of cells during the reprogramming and hiPSC formation based on cellular morphology on the microscopy images. The numbers of respective types of cells in cell clusters before the hiPSC formation stage are used to predict if hiPSC regions can be well formed lately. Experimental results show good prediction by the criteria using the numbers of different cells in the clusters. Time-series images with respective types of classified cells can be used to visualize and quantitatively analyze the growth and transition among dispersed cells not in cell clusters, various types of cells in the clusters before the hiPSC formation stage and hiPSC cells.
|
|
15:45-17:30, Paper WeEP-05.3 | |
Synthesizing 3D Lung CT Scans with Generative Adversarial Networks |
|
Ferreira, Artur | INESC TEC - Institute for Systems and Computer Engineering, Tech |
Pereira, Tania | INESC TEC - Institute for Systems and Computer Engineering, Tech |
Silva, Francisco | INESC TEC |
Ana Teresa, Vilares | CHUSJ - Centro Hospitalar E Universitário De São João |
Miguel Correia, Silva | CHUSJ - Centro Hospitalar E Universitário De São João |
Cunha, António | Universidade De Trás-Os-Montes E Alto Douro & INESC Tecnologia E |
Oliveira, Hélder P. | INESC TEC, Faculdade De Ciências, Universidade Do Porto |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, CT imaging, CT imaging applications
Abstract: In the healthcare domain, datasets are often private and lack large amounts of samples, making it difficult to cope with the inherent patient data heterogeneity. As an attempt to mitigate data scarcity, generative models are being used due to their ability to produce new data, using a dataset as a reference. However, synthesis studies often rely on a 2D representation of data, a seriously limited form of information when it comes to lung computed tomography scans where, for example, pathologies like nodules can manifest anywhere in the organ. Here, we develop a 3D Progressive Growing Generative Adversarial Network capable of generating thoracic CT volumes at a resolution of (128^3), and analyze the model outputs through a quantitative metric (3D Muli-Scale Structural Similarity) and a Visual Turing Test. Clinical relevance — This paper is a novel application of the 3D PGGAN model to synthesize CT lung scans. This preliminary study focuses on synthesizing the entire volume of the lung, rather than just the lung nodules. The synthesized data represent an attempt to mitigate data scarcity, which is one of the major limitations to create learning models with good generalization in healthcare.
|
|
15:45-17:30, Paper WeEP-05.4 | |
Unsupervised Approach for Malignancy Assessment of Lung Nodules in Computed Tomography Scans Using Radiomic Features |
|
Teixeira, Marco | INESC TEC - Institute for Systems and Computer Engineering, Tech |
Pereira, Tania | INESC TEC - Institute for Systems and Computer Engineering, Tech |
Silva, Francisco | INESC TEC |
Cunha, António | Universidade De Trás-Os-Montes E Alto Douro & INESC Tecnologia E |
Oliveira, Hélder P. | INESC TEC, Faculdade De Ciências, Universidade Do Porto |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image feature extraction, CT imaging applications
Abstract: Lung cancer is the leading cause of cancer death worldwide. Early low-dose computed tomography (CT) screening can decrease its mortality rate and computer-aided diagnoses systems may make these screenings more accessible. Radiomic features and supervised machine learning have traditionally been employed in these systems. Contrary to supervised methods, unsupervised learning techniques do not require large amounts of annotated data which are labor-intensive to gather and long training times. Therefore, recent approaches have used unsupervised methods, such as clustering, to improve the performance of supervised models. However, an analysis of purely unsupervised methods for malignancy prediction of lung nodules from CT images has not been performed. This work studies nodule malignancy in the LIDC-IDRI image collection of chest CT scans using established radiomic features and unsupervised learning methods based on k-Means, Spectral Clustering, and Gaussian Mixture clustering. All tested methods resulted in clusters of high homogeneity malignancy. Results suggest convex feature distributions and well-separated feature subspaces associated with different diagnoses. Furthermore, diagnosis uncertainty may be explained by common characteristics captured by radiomic features. The k-Means and Gaussian Mixture models are able to generalize to unseen data, achieving a balanced accuracy of 87.23% and 86.96% when inference was tested. These results motivate the usage of unsupervised approaches for malignancy prediction of lung nodules, such as cluster-then-label models.
|
|
15:45-17:30, Paper WeEP-05.5 | |
Computer-Aided Detection of Lesions from Coronary Angiograms Based on Contrast Learning Technique |
|
Pal, Poulomi | Indian Institute of Technology, Kharagpur |
Dey, Sumagna | Meghnad Saha Institute of Technology |
Mahadevappa, Manjunatha | Indian Institute of Technolgy Kharagpur |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Cardiac imaging and image analysis, Image feature extraction
Abstract: Coronary artery disease is one of the prevalent cardiovascular diseases in the world. Clinically, coronary artery angiography (CAG) is the most efficient diagnostic tool for detecting the stenosis caused by the presence of coronary lesions. Here, we proposed a simple but efficient methodology for predicting the coronary arterial block. The technique of classifying the angiograms collected from 369 patients is implemented using the contrast learning approach. ResNet 152 V2 is used as the deep network. Region of interest (ROI) is found for the diseased arteries for deciding the type of treatment procedure. Four different losses were implemented in this two-level classification technique. This framework achieved an accuracy of 0.81 recall of 0.76, precision of 0.86, specificity of 0.87, F-score of 0.80, and CSI of 0.67. A comparative study with the state-of-art is carried out to establish the advantage of the proposed method. This technique could be used by the clinicians.
|
|
15:45-17:30, Paper WeEP-05.6 | |
Novel No-Reference Multi-Dimensional Perceptual Similarity Metric |
|
Tirunellai Rajamani, Srividya | University of Augsburg |
Rajamani, Kumar | BNM Institute of Technology |
Rani, Priya | Applied Artificial Intelligence Institute, Deakin University |
Barick, Rashmita | Department of Computer Science, BNM Institute of Technology, Ban |
Mysore Sheshadri, Ramasubramanya | Department of Computer Science, BNM Institute of Technology, Ban |
V Aithal, Sridevi | Department of Computer Science, BNM Institute of Technology, Ban |
Elagiri Ramalingam, Rajkumar | Chief Information Officer, Indian Institute of Technology, Madra |
D Gowda, Sahana | Associate Dean, School of Computer Science and Engineering, RV U |
Schuller, Bjoern | Imperial College London |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image classification, Image enhancement
Abstract: Enormous progress has been made in the domain of determining image quality. However, even the recently proposed deep learning based perceptual quality metrics and the classical structural similarity metric (SSIM) are not designed to operate in the absence of a good quality reference image. Many of the image acquisition processes, especially in medical imaging, would immensely benefit from a metric that can indicate if the quality of an image is improving or worsening based on adaptation of the acquisition parameters. In this work, we propose a novel multi-dimensional no-reference perceptual similarity metric that can compute the quality of a given image without a reference pristine quality image by combining no-reference image quality metric (PIQUE) and perceptual similarity. The dimensions of quality currently explored are in the axis of noise, blur and contrast. Our experiments demonstrate that our proposed novel no-reference perceptual similarity metric correlates very well with the quality of an image in a multi-dimensional sense.
|
|
15:45-17:30, Paper WeEP-05.7 | |
Addressing the Intra-Class Mode Collapse Problem Using Adaptive Input Image Normalization in GAN-Based X-Ray Images |
|
Saad, Muhammad Muneeb | Munster Technological University Cork |
Rehmani, Mubashir Husain | Munster Technological University Cork |
O'Reilly, Ruairi | Munster Technological University Cork |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image reconstruction and enhancement - Image synthesis, X-ray radiography
Abstract: Biomedical image datasets can be imbalanced due to the rarity of targeted diseases. Generative Adversarial Networks play a key role in addressing this imbalance by enabling the generation of synthetic images to augment datasets. It is important to generate synthetic images that incorporate a diverse range of features to accurately represent the distribution of features present in the training imagery. Furthermore, the absence of diverse features in synthetic images can degrade the performance of machine learning classifiers. The mode collapse problem impacts Generative Adversarial Networks’ capacity to generate diversified images. Mode collapse comes in two varieties: intra-class and inter-class. In this paper, the intra-class mode collapse problem is investigated, and its subsequent impact on the diversity of synthetic X-ray images is evaluated. This work contributes an empirical demonstration of the benefits of integrating the adaptive input-image normalization for the Deep Convolutional GAN to alleviate the intra-class mode collapse problem. Results demonstrate that the DCGAN with adaptive input-image normalization outperforms DCGAN with un-normalized X-ray images as evident by the superior diversity scores.
|
|
15:45-17:30, Paper WeEP-05.8 | |
Multimodal Diagnosis for Pulmonary Embolism from EHR Data and CT Images |
|
ZHI, Zhuo | University College London |
Elbadawi, Moe | University College London |
Daneshmend, Adam | Imperial Hospitals NHS Trust |
Orlu, Mine | UCL |
Basit, Abdul Waseh | University College London |
Demosthenous, Andreas | University College London |
Rodrigues, Miguel R. D. | University College London |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, CT imaging
Abstract: Pulmonary Embolism (PE) is a severe medical condition that can pose a significant risk to life. Traditional deep learning methods for PE diagnosis are based on Computed Tomography (CT) images and do not consider the patient's clinical context. To make full use of patient’s clinical information, this article presents a multimodal fusion model ingesting Electronic Health Record (EHR) data and CT images for PE diagnosis. The proposed model is based on multilayer perception and convolutional neural networks. To remove the invalid information in the EHR data, the multidimensional scaling algorithm is performed for feature dimension reduction. The EHR data and CT images of 600 patients are used for experiments. The experiment results show that the proposed models outperform existing methods and the multimodal fusion model shows better performance than the single-input model.
|
|
15:45-17:30, Paper WeEP-05.9 | |
Ensemble Deep Learning Methods for Cell Detection and Tracking (withdrawn from program) |
|
Toubal, Imad | University of Missouri |
Al-Shakarji, Noor | University of Missouri Columbia |
Palaniappan, Kannappan | University of Missouri-Columbia |
|
15:45-17:30, Paper WeEP-05.10 | |
Wasserstein GAN Based Chest X-Ray Dataset Augmentation for Deep Learning Models: COVID-19 Detection Use-Case |
|
Hussain, B. Zahid | Aligarh Muslim University |
Andleeb, Ifrah | Aligarh Muslim University |
Ansari, Mohammad Samar | University of Chester |
Joshi, Amit M. | Malaviya National Institute of Technology |
Kanwal, Nadia | Keele University |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Machine learning / Deep learning approaches, X-ray imaging applications
Abstract: The novel coronavirus infection (COVID-19) is still continuing to be a concern for the entire globe. Since early detection of COVID-19 is of particular importance, there have been multiple research efforts to supplement the current standard RT-PCR tests. Several deep learning models, with varying effectiveness, using Chest X-Ray images for such diagnosis have also been proposed. While some of the models are quite promising, there still remains a dearth of training data for such deep learning models. The present paper attempts to provide a viable solution to the problem of data deficiency in COVID-19 CXR images. We show that the use of a Wasserstein Generative Adversarial Network (WGAN) could lead to an effective and lightweight solution. It is demonstrated that the WGAN generated images are at par with the original images using inference tests on an already proposed COVID-19 detection deep learning model.
|
|
15:45-17:30, Paper WeEP-05.11 | |
CCNET: Cross Coordinate Network for Joint Diabetic Retinopathy and Diabetic Macular Edema Grading |
|
Yue, Taotao | Tsinghua University |
Yang, Wenming | Tsinghua University |
Liao, Qingmin | Tsinghua University |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image classification, Image feature extraction
Abstract: With the rapid development of the world economy and increasing improvement of people's living standards, the number of diabetic patients has been growing quickly. Meanwhile, the complications of diabetes especially retinopathy have been affecting their daily life seriously. The only way to prevent it from getting worse and even leading to blindness is to make corresponding diagnosis as early as possible. However, it's extremely impossible for professionals to diagnose all the patients through their fundus images. It couldn't be better to solve the problem by automatic systems, so we present a novel network to learn the features of diabetic retinopathy (DR) and its complication diabetic macular edema (DME) and the relationship between them, focus on some vital areas in the pictures and eventually obtain the grades of the two diseases at the same time. Experimental results further prove the effectiveness of our proposed module comparing to the only joint grading network before.
|
|
WeEP-06 |
Hall 5 |
Theme 02. Image Reconstruction & Enhancement |
Poster Session |
Chair: Choi, Kihwan | Korea Institute of Science and Technology |
|
15:45-17:30, Paper WeEP-06.1 | |
Fast MRI Reconstruction: How Powerful Transformers Are? |
|
Huang, Jiahao | Imperial College London |
Wu, Yinzhe | Imperial College London |
Wu, Huanjun | Imperial College London |
Yang, Guang | Imperial College London |
Keywords: Image reconstruction - Fast algorithms, Image reconstruction and enhancement - Machine learning / Deep learning approaches, Magnetic resonance imaging - MR neuroimaging
Abstract: Magnetic resonance imaging (MRI) is a widely used non-radiative and non-invasive method for clinical interrogation of organ structures and metabolism, with an inherently long scanning time. Methods by k-space undersampling and deep learning based reconstruction have been popularised to accelerate the scanning process. This work focuses on investigating how powerful transformers are for fast MRI by exploiting and comparing different novel network architectures. In particular, a generative adversarial network (GAN) based Swin transformer (ST-GAN) was introduced for the fast MRI reconstruction. To further preserve the edge and texture information, edge enhanced GAN based Swin transformer (EES-GAN) and texture enhanced GAN based Swin transformer (TES-GAN) were also developed, where a dual-discriminator GAN structure was applied. We compared our proposed GAN based transformers, standalone Swin transformer and other convolutional neural networks based GAN model in terms of the evaluation metrics PSNR, SSIM and FID. We showed that transformers work well for the MRI reconstruction from different undersampling conditions. The utilisation of GAN's adversarial structure improves the quality of images reconstructed when undersampled for 30% or higher. The code is publicly available at https://github.com/ayanglab/SwinGANMR.
|
|
15:45-17:30, Paper WeEP-06.2 | |
Grad-CAM Guided U-Net for MRI-Based Pseudo-CT Synthesis |
|
Dovletov, Gurbandurdy | University of Duisburg-Essen |
Pham, Duc Duy | University of Duisburg-Essen |
Lörcks, Stefan | University of Duisburg-Essen |
Pauli, Josef | Duisburg-Essen, Intelligente Systeme |
Gratz, Marcel | High-Field and Hybrid MR Imaging, University Hospital Essen, Uni |
Quick, Harald H. | High-Field and Hybrid MR Imaging, University Hospital Essen, Uni |
Keywords: Image reconstruction and enhancement - Machine learning / Deep learning approaches, Image reconstruction and enhancement - Image synthesis
Abstract: In this paper, we address the task of image-to-image translation from MRI to CT domain. We propose a 2D U-Net-based deep learning approach for pseudo-CT synthesis that incorporates an additional Grad-CAM guided attention mechanism for superior image translation of bone regions. The suggested architecture consists of image-to-image translation and image classification modules. We first train our classifier to distinguish between MR and CT images. After that, we utilize it in combination with the Grad-CAM technique to provide additional guidance to our image-to-image translation network. We generate CT-class-specific localization maps for both CT and pseudo-CT images and then compare them. Thus, we force the image-to-image translation network to focus on relevant attributes of the CT class, such as bone structures, while learning to synthesize pseudo-CTs. The performance of the proposed approach is evaluated on the publicly available RIRE data set. Since MR and CT images in this data set are not correctly aligned with each other, we also briefly describe the applied image registration procedure. The experimental results are compared to the baseline U-Net model and demonstrate both qualitative and quantitative improvements, whereas significant performance gain is achieved for bone regions. Clinical relevance — MRI-based pseudo-CT synthesis is essential for attenuation correction of PET in combined PET/MRI systems and plays a vital role in MRI-only radiotherapy planning. Accurate pseudo-CTs can prevent patients from harmful and unnecessary radiation exposure.
|
|
15:45-17:30, Paper WeEP-06.3 | |
A Comparative Study between Image and Projection-Domain Self-Supervised Denoising for Ultra Low-Dose CBCT |
|
Choi, Kihwan | Korea Institute of Science and Technology |
Keywords: Image reconstruction and enhancement - Machine learning / Deep learning approaches, Image enhancement - Denoising, CT imaging
Abstract: We compare image domain and projection domain denoising approaches with self-supervised learning for ultra low-dose cone-beam CT (CBCT), where number of detected x-ray photons is significantly low. For image-domain self-supervised denoising, we first reconstruct CBCT images with the standard filtered backprojection. For model training, we use blind-spot filtering to partially blind images and recover the blind spots. For projection-domain self-supervised denoising, we regard the post-log projections as training examples of convolutional neural network. From experimental results with various low-dose CBCT settings, the projection-domain denoiser outperforms the image-domain denoiser both in image quality and accuracy for ultra low-dose CBCT.
|
|
15:45-17:30, Paper WeEP-06.4 | |
Improving Fast MRI Reconstructions with Pretext Learning in Low-Data Regime |
|
JETHI, AMRIT | HTIC |
Souza, Roberto | University of Calgary |
Sirukarumbur Shanmugaram, Keerthi Ram | IIT Madras |
Sivaprakasam, Mohanasankar | Indian Institute of Technology Madras |
Keywords: Image reconstruction - Fast algorithms, Image reconstruction and enhancement - Machine learning / Deep learning approaches, Magnetic resonance imaging - MR neuroimaging
Abstract: Supervised deep learning methods have shown great promise for making magnetic resonance (MR) imaging scans faster. However, these supervised deep learning models need large volumes of labelled data to learn valuable representations and produce high-fidelity MR image reconstructions. The data used to train these models are often fully-sampled raw MR data, retrospectively under-sampled to simulate different MR acquisition acceleration factors. Obtaining high-quality, fully sampled raw MR data is costly and time-consuming. In this paper, we exploit the self-supervision based learning by introducing a pretext method to boost feature learning using the more commonly available under-sampled MR data. Our experiments using different deep-learning-based reconstruction models in a low data regime demonstrate that self-supervision ensures stable training and improves MR image reconstruction.
|
|
15:45-17:30, Paper WeEP-06.5 | |
Synthetic Generation of Cardiac MR Images Combining Convolutional Variational Autoencoders and Style Transfer |
|
Jaén-Lorites, José Manuel | Universitat Politècnica De València |
Pérez-Pelegrí, Manuel | Universitat Politècnica De València |
Laparra, Valero | Universitat De València |
López-Lereu, María P. | ERESA |
Monmeneu, José V. | ERESA |
Maceira, Alicia M. | ASCIRES Grupo Biomédico |
Moratal, David | Universitat Politècnica De València |
|
|
15:45-17:30, Paper WeEP-06.6 | |
Deep-Learning for High Quality and High Quantitative Ultrasonic Echo Imaging |
|
Li, Yiran | Sophia University |
Zhang, Mengfei | Sophia University |
Ogane, Gento | Dept of Info &Commun Sci, Sophia University |
Sumi, Chikayoshi | Sophia University |
Keywords: Image reconstruction and enhancement - Machine learning / Deep learning approaches, Machine learning / Deep learning approaches, Ultrasound imaging - Breast
Abstract: This paper performs in simulations deep learning (DL) for high quality and high quantitative ultrasonic (US) echo imaging: (i) reduction of multiple echoes (multiple reverberations) and (ii) grading lobe echoes, (iii) separation of multiply crossed waves in US echo images, (iv) US attenuation correction imaging and (v) superresolutioned reflection and scattering imaging. In addition, (vi) segmentations of benign and malignant (cancerous) tumors in breast tissues are also performed. This study about DL suggests the possibility of DL US segmentation for the automatic differential diagnosis about the human in vivo breast tumors in conjunction with the surrounding DL models.
|
|
15:45-17:30, Paper WeEP-06.7 | |
The Complex-Valued PD-Net for MRI Reconstruction of Knee Images |
|
Jain, Poornima | International Institute of Information Technology Bangalore |
Chakka, Sai Pradeep | International Institute of Information Technology Bangalore |
Sinha, Neelam | International Institute of Information Technology, Bangalore |
Keywords: Image reconstruction - Performance evaluation, Image reconstruction and enhancement - Machine learning / Deep learning approaches, Magnetic resonance imaging - Other organs
Abstract: MRI reconstruction is the fundamental task of obtaining diagnostic quality images from MRI sensor data and is an active area of research for improving accuracy, speed and memory requirements of the process. Complexvalued neural networks have previously achieved superior MRI reconstructions compared to real-valued nets. But those works operated in the image domain to denoise poor quality reconstructions of the raw sensor (k-space) data. Also smallscale or proprietary datasets with few clinical images or raw k-space volumes were used in these works, and none of the works use publicly available large-scale raw k-space datasets. Recent studies have shown that cross-domain neural networks for MRI reconstruction, or networks which leverage information from both k-space and image domains, have better potential than single-domain networks which operate only in one domain. We study the effects of complex-valued operations on a top-performing cross-domain neural network for MRI reconstruction called the Primal-Dual net, or PD-net. The PDnet is a fully convolutional architecture that takes input as raw k-space data and outputs the reconstructions, thus performing both the inversion and denoising tasks. We experiment with the publicly available, large-scale fastMRI single-coil knee dataset having 973 train volumes and 199 validation volumes. Our proposed method (Complex PD-net) achieves PSNR and SSIM of 33.3 dB and 0.8033 respectively, compared to 32.13 dB and 0.728 obtained by PD-net. Our Complex PD-net achieves 10.3% higher SSIM with just over 50% of the total parameters w.r.t. the SOTA methodology.
|
|
15:45-17:30, Paper WeEP-06.8 | |
Synthesizing Contrast-Enhanced Computed Tomography Images with an Improved Conditional Generative Adversarial Network |
|
Yang, Yulin | Ritsumeikan University |
Xu, Caie | Zhejiang University of Science and Technology |
Chen, Qingqing | Sir Run Shaw Hospital, Zhejiang University |
Iwamoto, Yutaro | Ritsumeikan University |
Han, Xianhua | Ritsumeikan University |
Tong, Ruofeng | Zhejiang University |
Lin, Lanfen | Zhejiang University |
Hu, Hongjie | Sir Run Run Shaw Hospital |
Chen, Yen-Wei | Ritsumeikan University |
Keywords: Image reconstruction and enhancement - Image synthesis, CT imaging, CT imaging applications
Abstract: Contrast-enhanced computed tomography (CE-CT) images are widely used for diagnosis of focal liver lesions. Compared with the non-contrast CT (NC CT) images, the CE-CT images are obtained after injecting the contrast, which will increase physical burden of patients. To handle the limitation, we proposed an improved conditional generative adversarial network (improved cGAN) to synthesize CE-CT images from non-contrast CT images (CT scans without injection). In the improved cGAN, we incorporate a pyramid pooling module and a feature fusion module to the generator to improve the capability of encoder in capturing global information and prevent the dilution of information in the process of decoding. To evaluate the performance of our proposed method, we conducted experiments on a contrast-enhanced CT dataset, which consists of four different types of focal liver lesions (i.e., focal nodular hyperplasia, hepatocellular carcinoma, metastasis and cyst), each case contains three phases of non-contrast image, CE-CT images in arterial and portal venous phases. Both quantitative and qualitative results indicate the proposed method is superior to existing GAN-based models.
|
|
15:45-17:30, Paper WeEP-06.9 | |
Cross-Modality Image Adaptation Based on Volumetric Intensity Gaussian Process Models (VIGPM) |
|
Nbonsou Tegang, Herve Nicolas | University of Cape Town |
Borotikar, Bhushan | University of Western Brittany |
Fouefack, Jean-Rassaire | University of Cape Twon |
Burdin, Valerie | IMT Atlantique/Institut Mines Telecom - INSERM U1101 |
Mutsvangwa, Tinashe Ernest Muzvidzwa | University of Cape Town |
Keywords: Image reconstruction and enhancement - Image synthesis, Image reconstruction and enhancement - Tomographic reconstruction, CT imaging
Abstract: Image-based diagnosis routinely depends on more that one image modality for exploiting the complementary information they provide. However, it is not always possible to obtain images from a secondary modality for several reasons such as cost, degree of invasiveness and non-availability of scanners. Three-dimensional (3D) morphable models have made a significant contribution to the field of medical imaging for feature-based analysis. Here we extend their use to encode 3D volumetric imaging modalities. Specifically, we build a Gaussian Process (GP) over transformations establishing anatomical correspondence between training images within a modality. Given, two different modalities, the GP’s eigenspace (latent space) can then be used to provide a parametric representation of each image modality, and we provide an operator for cross- domain translation between the two. We show that the latent space yields samples that are representative of the encoded modality. We also demonstrate that a 3D volumetric image can be efficiently encoded in latent space and transferred to synthesize the corresponding image in another modality. The framework called VIGPM can be extended by designing a fitting process to learn an observation in a given modality and performing cross-modality synthesis. Clinical relevance— The proposed method provides a way to access a multi modality image from one modality. Both the source and synthetic modalities are in anatomical correspon- dence, giving access to registered complementary information.
|
|
WeEP-07 |
Hall 5 |
Theme 02. Image Segmentation - P1 |
Poster Session |
Chair: Laine, Andrew F. | Columbia University |
|
15:45-17:30, Paper WeEP-07.1 | |
A Deep Graph Cut Model for 3D Brain Tumor Segmentation |
|
De, Arijit | Jadavpur University |
Tiwari, Mona | Institute of Neuroscience |
Grisan, Enrico | London South Bank University |
Chowdhury, Ananda | Jadavpur University |
Keywords: Image segmentation, Magnetic resonance imaging - MR neuroimaging, Machine learning / Deep learning approaches
Abstract: Brain tumor segmentation plays a key role in tumor diagnosis and surgical planning. In this paper, we propose a solution to the 3D brain tumor segmentation problem using deep learning and graph cut from the MRI data. In particular, the probability maps of a voxel to belong to the object (tumor) and background classes from the UNet are used to improve the energy function of the graph cut. We derive new expressions for the data term, the region term and the weight factor balancing the data term and the region term for individual voxels in our proposed model. We validate the performance of our model on the publicly available BRATS 2018 dataset. Our segmentation accuracy with a dice similarity score of 0.92 is found to be higher than that of the graph cut and the UNet applied in isolation as well as over a number of state of the art approaches.
|
|
15:45-17:30, Paper WeEP-07.2 | |
Automated Microsurgical Tool Segmentation and Characterization in Intra-Operative Neurosurgical Videos |
|
P, Pon Deepika | International Institute of Information Technology Bangalore |
Udupa, Karthik | International Institute of Information Technology Bangalore |
Beniwal, Manish | NIMHANS |
Mohan Uppar, Alok | NIMHANS |
Vazhiyal, Vikas | NIMHANS |
Rao, Madhav | IIITBangalore |
Keywords: Image segmentation, Machine learning / Deep learning approaches, Image analysis and classification - Machine learning / Deep learning approaches
Abstract: Checklist based routine evaluation of surgical skills in any medical school demands quality time and effort from the supervising expert and is highly influenced by assessor bias. Alternatively, automated video based surgical skill assessment is a simple and viable method to analyze surgical dexterity offline without the need for acute presence of an expert surgeon throughout the surgery. In this paper, a novel approach and results for the automated segmentation of microsurgical instruments from the real-world neurosurgical video dataset was presented. The proposed tool segmentation model showcased mean average precision of 96.7% in detecting and localizing five surgical instruments from the real-world neurosurgical videos. Accurate detection and characterization of motion features of the microsurgical tool from the novel annotated neurosurgical video dataset forms the key step towards automated surgical skill evaluation. Clinical relevance— Tool segmentation, localization, and characterization in neurosurgical video, has several applications including assessing surgeons skills, training novice surgeons, understanding critical operating procedures post surgery, characterizing any critical anatomical response to the tool that leads to the success or failure of the surgery, and building models for conducting autonomous robotic surgery. Semantic segmentation, and characterization of the microsurgical tools forms the basis of the modern neurosurgery.
|
|
15:45-17:30, Paper WeEP-07.3 | |
Segmentation with Residual Attention U-Net and an Edge-Enhancement Approach Preserves Cell Shape Features |
|
Zhu, Nanyan | Columbia University |
Liu, Chen | Columbia University |
Forsyth, Britney | Columbia University |
Singer, Zakary | Columbia University |
Laine, Andrew F. | Columbia University |
Danino, Tal | Columbia University |
Guo, Jia | Columbia University |
Keywords: Image segmentation, Machine learning / Deep learning approaches
Abstract: The ability to extrapolate gene expression dynamics in living single cells requires robust cell segmentation, and one of the challenges is the amorphous or irregularly shaped cell boundaries. To address this issue, we modified the U-Net architecture to segment cells in fluorescence widefield microscopy images and quantitatively evaluated its performance. We also proposed a novel loss function approach that emphasizes the segmentation accuracy on cell boundaries and encourages shape feature preservation. With a 97% sensitivity, 93% specificity, 91% Jaccard similarity, and 95% Dice coefficient, our proposed method, called Residual Attention U-Net with edge-enhancement, surpassed the state-of-the-art U-Net in segmentation performance as evaluated by traditional metrics. The proposed model also performed best for preserving cell shape features, namely area, eccentricity, major axis length, solidity and orientation. These improvements can serve as useful assets for downstream cell tracking and quantification of changes in cell statistics or features over time.
|
|
15:45-17:30, Paper WeEP-07.4 | |
Improving the Segmentation of Pediatric Low-Grade Gliomas through Multitask Learning |
|
Vafaeikia, Partoo | University of Toronto |
Wagner, Matthias W. | The Hospital for Sick Children |
Hawkins, Cynthia | The Hospital for Sick Children |
Tabori, Uri | The Hospital for Sick Children |
Ertl-Wagner, Birgit B. | The Hospital for Sick Children |
Khalvati, Farzad | University of Toronto |
Keywords: Image segmentation, Machine learning / Deep learning approaches, Brain imaging and image analysis
Abstract: Brain tumor segmentation is a critical task for tumor volumetric analyses and AI algorithms. However, it is a time-consuming process and requires neuroradiology expertise. While there has been extensive research focused on optimizing brain tumor segmentation in the adult population, studies on AI guided pediatric tumor segmentation are scarce. Furthermore, MRI signal characteristics of pediatric and adult brain tumors differ, necessitating the development of segmentation algorithms specifically designed for pediatric brain tumors. We developed a segmentation model trained on magnetic resonance imaging (MRI) of pediatric patients with low-grade gliomas (pLGGs) from The Hospital for Sick Children (Toronto, Ontario, Canada). The proposed model utilizes deep Multitask Learning (dMTL) by adding tumor's genetic alteration classifier as an auxiliary task to the main network, ultimately improving the accuracy of the segmentation results.
|
|
15:45-17:30, Paper WeEP-07.5 | |
ITUnet: Integration of Transformers and Unet for Organs-At-Risk Segmentation |
|
Kan, Hongyu | University of Science and Technology of China |
Shi, Jun | University of Science and Technology of China |
Zhao, Minfan | University of Science and Technology of China |
Wang, Zhaohui | University of Science and Technology of China |
Han, Wenting | USTC |
An, Hong | USTC |
Wang, Zhaoyang | University of Birmingham |
Wang, Shuo | School of Computer Science, University of Birmingham |
Keywords: Image segmentation, Machine learning / Deep learning approaches, CT imaging applications
Abstract: Recently, convolutional neural network(CNN) has achieved great success in medical image segmentation. However, due to the limitation of convolutional receptive field, the pure convolutional neural network is difficult to further improve its performance. Given the outstanding ability of transformers in extracting the long-range dependency, some works have successfully applied it to computer vision and achieved better results than CNN in some tasks. Based on transformers could remedy the shortage of CNN, in this paper, we propose ITUnet, a segmentation network using CNN and transformers as features extractor. The combination of CNN and transformers enables the network to learn both short- and long-range dependency of features, which is beneficial to segmentation tasks. We evaluate our method on a head-and-neck CT dataset which has 18 kinds of organs to be segmented. The experimental results demonstrate that our proposed method shows better accuracy and robustness, the proposed methods achieve the Dice score of 77.72 and the 95% Hausdorff Distance of 2.31, outperforming the existing methods.
|
|
15:45-17:30, Paper WeEP-07.6 | |
Learning to Segment Fine Structures under Image-Level Supervision with an Application to Nematode Segmentation |
|
Chen, Long | RWTH Aachen University, Aachen, Germany |
Strauch, Martin | RWTH Aachen University |
Daub, Matthias | Julius Kühn Institute: Federal Research Centre for Cultivated Pl |
Luigs, Hans-Georg | LemnaTec GmbH, Aachen, Germany |
Jansen, Marcus | LemnaTec GmbH, Aachen, Germany |
Merhof, Dorit | RWTH Aachen University |
Keywords: Image segmentation, Machine learning / Deep learning approaches, Optical imaging and microscopy - Microscopy
Abstract: Image segmentation models trained only with image-level labels have become increasingly popular as they require significantly less annotation effort than models trained with scribble, bounding box or pixel-wise annotations. While methods utilizing image-level labels achieve promising performance for the segmentation of larger-scale objects, they perform less well for the fine structures frequently encountered in biological images. In order to address this performance gap, we propose a deep network architecture based on two key principles, Global Weighted Pooling (GWP) and segmentation refinement by low-level image cues, that, together, make segmentation of fine structures possible. We apply our segmentation method to image datasets containing such fine structures, nematodes (worms + eggs) and nematode cysts immersed in organic debris objects, which is an application scenario encountered in automated soil sample screening. Supervised only with image-level labels, our approach achieves Dice coefficients of 79.72% and 58.51 % for nematode and nematode cyst segmentation, respectively.
|
|
15:45-17:30, Paper WeEP-07.7 | |
Residual Channel Attention Network for Brain Glioma Segmentation |
|
Yao, Yiming | Shanghai University |
Qian, Peisheng | Institute for Infocomm Research (I2R), Agency for Science |
Zhao, Ziyuan | Institute for Infocomm Research (I2R), Agency for Science, Techn |
Zeng, Zeng | Institute for Infocomm Research (I2R), Agency for Science, Techn |
Keywords: Image segmentation, Magnetic resonance imaging - Other organs, Brain imaging and image analysis
Abstract: A glioma is a malignant brain tumor that seriously affects cognitive functions and lowers patients’ life quality. Segmentation of brain glioma is challenging because of interclass ambiguities in tumor regions. Recently, deep learning approaches have achieved outstanding performance in the automatic segmentation of brain glioma. However, existing algorithms fail to exploit channel-wise feature interdependence to select semantic attributes for glioma segmentation. In this study, we implement a novel deep neural network that integrates residual channel attention modules to calibrate intermediate features for glioma segmentation. The proposed channel attention mechanism adaptively weights feature channel-wise to optimize the latent representation of gliomas. We evaluate our method on the established dataset BraTS2017. Experimental results indicate the superiority of our method.
|
|
15:45-17:30, Paper WeEP-07.8 | |
Spine Segmentation with Multi-View GCN and Boundary Constraint |
|
wang, dexu | Shanghai Jiao Tong University |
Yang, Zhikai | Shanghai Jiao Tong University |
Huang, Ziyan | Shanghai Jiao Tong University |
Gu, Lixu | Shanghai Jiaotong University |
Keywords: Image segmentation, Machine learning / Deep learning approaches, Magnetic resonance imaging - Other organs
Abstract: Multi-class segmentation of vertebrae and intervertebral discs (IVDs) is crucial for the diagnosis and treatment of spinal diseases. However, it is still a challenge due to similarities between neighboring vertebrae of a subject and differences among the IVDs from different subjects. In this paper, we propose a novel spine segmentation framework to achieve automatic segmentation of vertebrae and IVDs in MR images. The core component of the new framework is a Multi-View GCN (MVGCN), which utilizes multi-view features and graph convolutional network (GCN) to reason about the relations of vertebrae and IVDs. We additionally use a boundary constraint for better segmentation of the boundary between vertebrae and IVDs. We test our method on a public spine dataset of 172 MR volumetric images for the vertebrae and IVDs segmentation. The experimental results demonstrate the efficacy of our method. Code and models of our method will be available in the future.
|
|
15:45-17:30, Paper WeEP-07.9 | |
A Fast and Memory-Efficient Brain MRI Segmentation Framework for Clinical Applications |
|
Nejad, Ashkan | University of Groningen |
Masoudnia, Saeed | Tehran University of Medical Sciences |
Nazem-Zadeh, Mohammad-Reza | Tehran University of Medical Sciences |
Keywords: Image segmentation, Brain imaging and image analysis, Magnetic resonance imaging - MR neuroimaging
Abstract: Current segmentation tools of brain MRI provide quantitative structural information for diagnosing neurological disorders. However, their clinical application is generally limited due to high memory and time consumption. Although 3D CNN-based segmentation methods have recently achieved the state-of-the-art and come up with timely available results, they heavily require high memory GPUs. This paper customizes a memory-efficient (GPU) brain structure segmentation framework based on nnU-nets, enabling our framework to adapt its architecture based on memory constraints dynamically. To further reduce the need for memory, we also reduce multi-label brain segmentation to the fusion of single label segmentation. Single label patches are extracted from the T1w and segmentation maps by locating the approximate area of each structure on the MNI305 template, including the safety margin. These considerations not only decrease the hardware usage but also maintains comparable computational time. Moreover, the target brain structures are customizable based on the specific clinical application. We evaluate the performance in terms of dice coefficient, runtime, and GPU requirement on OASIS-3 and CoRR-BNU1 datasets. The validation results show comparable accuracy with similar methods and confirm generalizability on unseen datasets while significantly reducing GPU requirements and maintaining runtime duration. Our framework is runnable on a budget GPU with a minimum requirement of 4G RAM. We develop a memory-efficient deep Brain MRI segmentation tool that significantly reduces the hardware requirement of MRI segmentation while maintaining comparable accuracy and time. These advantages make our framework suitable for widespread use in clinical applications, especially for clinics with a limited budget. We plan to release the framework as a part of a free clinical brain imaging analysis tool. The code for this framework is publicly available.
|
|
WeEP-08 |
Hall 5 |
Theme 02. Machine Learning/Deep Learning Applications - P1 |
Poster Session |
|
15:45-17:30, Paper WeEP-08.1 | |
A Novel Multi-View Deep Learning Approach for BI-RADS and Density Assessment of Mammograms |
|
Nguyen, Thi Xuan Huyen | VinBigdata |
Tran, Bao Sam | VinBigdata |
Nguyen, Ba Dung | VinBigdata |
Pham, Huy Hieu | VinUni-Illinois Smart Health Center |
Nguyen, Quy Ha | VinBigdata |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, X-ray radiography, Image classification
Abstract: Advanced deep learning (DL) algorithms may predict the patient’s risk of developing breast cancer based on the Breast Imaging Reporting and Data System (BI-RADS) and density standards. Recent studies have suggested that the combination of multi-view analysis improved the overall breast exam classification. In this paper, we propose a novel multi-view DL approach for BI-RADS and density assessment of mammograms. The proposed approach first deploys deep convolutional networks for feature extraction on each view separately. The extracted features are then stacked and fed into a Light Gradient Boosting Machine (LightGBM) classifier to predict BI-RADS and density scores. We conduct extensive experiments on both the internal mammography dataset and the public dataset Digital Database for Screening Mammography (DDSM). The experimental results demonstrate that the proposed approach outperforms the single-view classification approach on two benchmark datasets by huge F1-score margins (+5% on the internal dataset and +10% on the DDSM dataset). These results highlight the vital role of combining multi-view information to improve the performance of breast cancer risk prediction.
|
|
15:45-17:30, Paper WeEP-08.2 | |
Orthographic Pooling: Learned Maximum Intensity Projection for Vertebrae Labelling |
|
Cai, Bin | Hefei Institute of Physical Science, Chinese Academy of Sciences |
Guo, Yuhao | Anhui University |
Liang, Pengpeng | Zhengzhou University |
Wang, Kaifeng | Peking University People's Hospital |
Sun, Zhiyong | Institute of Intelligent Machines, Hefei Institute of Physical S |
xiong, chi | The First Affiliated Hospital of USTC, Division of Life Sciences |
Song, Bo | Hefei Institutes of Physical Science, Chinese Academy of Science |
niu, chaoshi | The First Affiliated Hospital of USTC, Division of Life Sciences |
Cheng, Erkang | Institute of Intelligent Machines |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Machine learning / Deep learning approaches, CT imaging
Abstract: Maximum intensity projection (MIP) is a standard volume-rendering technique for 3D volumetric data processing. For example, given a 3D CT data, it simply projects the voxel values with its maximum intensity on a specific view to output a 2D image. Recently, MIP is further combined with Btrfly Net for vertebrae labelling task. However, this simple reformations of 3D data leads to loss of rich context information in volumetric data. In this paper, we propose a learned orthographic pooling approach instead of image processing based MIP. Typically, a simple conv-simple and bottleneck pooling modules are introduced to learn the orthographic projection of 3D data and output 2D intermediate feature maps. To this end, the learned orthographic pooling helps preserve detail information of 3D context during projection. Furthermore, an unified Btrfly Net is provided for vertebrae labelling by integrating the orthographic pooling sub-network. The novel Btrfly Net with orthographic pooling sub-network is evaluated on the 2014 MICCAI vertebra localization challenge dataset. Compared to original Butfly Net with MIP, orthographic pooling, the learned MIP largely boosts the performance of vertebrae labelling.
|
|
15:45-17:30, Paper WeEP-08.3 | |
A Multi-Scale Attention-Based Convolutional Network for Identification of Alzheimer's Disease Based on Hippocampal Subfields |
|
Xu, Hongbo | University of Chinese Academy of Sciences |
Liu, Yan | University of Chinese Academy of Sciences |
zeng, xiangzhu | Peking University Third Hospital, Beijing, China |
Wang, Ling | University of Electronic Science and Technology of China |
Wang, Zheng | Capital University of Medical Sciences |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Brain imaging and image analysis, Multiscale image analysis
Abstract: Hippocampus is an important anatomical region for Alzheimer’s Disease (AD) identification. In this paper, a multi-scale attention-based convolutional network is proposed for AD identification. The two dimensional (2D) images in three different planes of hippocampal subfields are used as input of three branches of the proposed network, which achieves effective extraction of three dimensional (3D) data features while reducing the network complexity and improving the computational efficiency. The end-to-end 2D multi-scale attention-based deep learning network improves the diversity of the extracted features and captures significance of various voxels for classification, which achieves significant classification performance without handcrafted feature extraction and model stacking. Experimental results illustrate the effectiveness of the proposed method on AD identification. The proposed method will be useful for further medical analysis on hippocampal subfields of the brain for diagnosis of neurodegenerative disease.
|
|
15:45-17:30, Paper WeEP-08.4 | |
An Unsupervised Region of Interest Extraction Model for Tau PET Images and Its Application in the Diagnosis of Alzheimer's Disease |
|
Shi, Rong | Shanghai University |
Wang, Luyao | Shanghai University |
jiang, jiehui | Shanghai University |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, PET and SPECT Imaging applications, Brain imaging and image analysis
Abstract: Background: Recently, tau positron-emission tomography (PET) images have been widely used for the diagnosis of Alzheimer's disease (AD). However, existing semi-quantitative uptake value ratios (SUVR) calculation is usually based on group analysis or specific brain regions from existing templates, which cannot detect individual heterogeneity. In this study, we proposed a novel deep learning model; called generative adversarial networks constrained multiple loss autoencoder for tau (GANCMLAE4TAU), to extract individual regions of interest (ROIs) of tau deposition. Methods: The basic framework of the proposed model is composed of two encoders, one decoder, and one discriminator. Tau PET images of 327 cognitive normal (CN) subjects from Alzheimer’s Disease Neuroimaging Initiative (ADNI) were used to train the model and 29 CNs from Huashan Hospital were used as an external validation group. The other 57 AD patients and 83 CNs subjects from ADNI were used in the classification task. The Structural Similarity (SSIM), Peak Signal to Noise Ratio (PSNR) and Mean Square Error (MSE) were applied to validate the robustness of our model. In addition, we conducted a receiver operating characteristic curve (ROC) analysis for the SUVR of individual ROIs from the GANCMLAE4TAU model and compared it with SUVR of the whole brain and ROIs from the templates. Results: Our model achieved good SSIM (0.963±0.006), PSNR (35.960±3.458) and MSE (0.0004±0.0003). In ROC analysis, our model had the highest area under curve (AUC) (0.869, 0.809-0.929) in discriminating AD from CN subjects. Conclusion: GANCMLAE4TAU could detect individual ROIs for tau PET images and had the potential to be developed as a novel diagnostic tool in the future.
|
|
15:45-17:30, Paper WeEP-08.5 | |
Automatic Diagnosis of Early-Stage Oral Cancer and Precancerous Lesions from ALA-PDD Images Using GAN and CNN |
|
Fujimoto, Taro | Waseda University |
Fukuzawa, Eiji | Yazaki Corporation, Waseda University |
Tatehara, Seiko | Department of Oral Medicine and Stomatology, School of Dental Me |
Satomura, Kazuhito | Department of Oral Medicine and Stomatology, School of Medicine, |
Ohya, Jun | Waseda University |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches
Abstract: A screening system for early-stage oral cancer and precancerous lesions should be established because it is difficult to detect them even for specialists and they are often detected too late. In this paper, we propose a method for automatically classifying fluorescence images acquired by ALA-PDD (Photodynamic Diagnosis using 5-Aminolevulinic Acid) into three classes: Normal, Low-Risk, High-Risk. We augment a small image dataset by training GAN (Generative adversarial networks) with Differentiable Augmentation, and then train CNN (Convolutional Neural Network) for the classification by the augmented dataset. Experimental results show good classification results, which suggest that the combination of ALA-PDD and CNN classification is a promising method for oral cancer screening.
|
|
15:45-17:30, Paper WeEP-08.6 | |
Deep Learning Approach for Classifying Bacteria Types Using Morphology of Bacterial Colony |
|
Amano, Masaki | The University of Electro-Communications |
Mai Duc, Tho | The University of Electro-Communications |
Ishibashi, Koichiro | The University of Electro-Communications |
Sun, Guanghao | The University of Electro-Communications |
Nguyen Vu, Trung | Department of Microbiology, Hanoi Medical University |
Le Thi, Hoi | Department of Microbiology and Parasitology, Hanoi Medical Unive |
Nguyen Thi, Hoa | The Department of Microbiology and National Tuberculosis Referen |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image classification
Abstract: The significant bottlenecks in determining bacterial species are much more time-consuming and the biology specialist’s long-term experience requirements. Specifically, it takes more than half a day to cultivate a bacterium, and then a skilled microbiologist and a costly specialized machine are utilized to analyze the genes and classify the bacterium according to its nucleotide sequence. To overcome these issues as well as get higher recognition accuracy, we proposed applying convolutional neural networks (CNNs) architectures to automatically classify bacterial species based on some key characteristics of bacterial colonies. Our experiment confirmed that the classification of three bacterial colonies could be performed with the highest accuracy (97.19%) using a training set of 5000 augmented images derived from the 40 original photos taken in the Hanoi Medical University laboratory in Vietnam.
|
|
15:45-17:30, Paper WeEP-08.7 | |
ADAN: An Adversarial Domain Adaptation Neural Network for Early Gastric Cancer Prediction |
|
Jie, Luyang | Shanghai University |
liang, pengchen | Shanghai University |
Zhao, Ziyuan | Institute for Infocomm Research (I2R), Agency for Science, Techn |
Chen, Jianguo | Agency for Science, Technology and Research |
Chang, qing | Jiading District Central Hospital Affiliated Shanghai University |
Zeng, Zeng | Institute for Infocomm Research (I2R), Agency for Science, Techn |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image classification, Image feature extraction
Abstract: Gastric cancer is a highly prevalent cancer worldwide. Accurate diagnosis of Early Gastric Cancer (EGC) is of great significance to improve the treatment and survival rate of patients. However, EGC and gastric ulcers have similar endoscopic image characteristics, resulting in a high misdiagnosis rate. Most existing deep learning and machine learning models for EGC recognition have the disadvantages of cumbersome pre-processing steps and high leakage ratios. To address the above challenges, we propose an end-to-end Adversarial Domain Adaptation Neural network (ADAN) for EGC prediction on endoscopic images. ADAN network consists of a source domain feature extractor, a source domain classifier, two target domain feature extractors, a target domain classifier, and a domain discriminator. A source domain feature extractor is designed to train the model on public gastrointestinal datasets, which effectively solves the problem of insufficient training data. In addition, an adaptive source-target domain mapping classifier is added to each target domain feature extractor for automatically adjusting the number of classification categories in the target domain. Experimental results show that the proposed ADAN network is superior to the most advanced methods and can accurately predict EGC in clinical practice.
|
|
15:45-17:30, Paper WeEP-08.8 | |
Analysis of Advanced Siamese Neural Networks for Motion Tracking of Sonography of Carotid Arteries |
|
Wasih, Mohammad | The Pennsylvania State University, University Park, Pennsylvania |
Almekkawy, Mohamed | Penn State University |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Machine learning / Deep learning approaches, Ultrasound imaging - Elastography
Abstract: The Siamese Tracker (ST) for tracking objects of interest in Ultrasound (US) images does not incorporate video-specific cues and assumes a fixed template of the reference block. Recently, a more advanced version of ST, Correlation Filter Network (CFNet), which overcomes the problems of ST, has been used for tracking in US images. In this study, we demonstrate how the basic CFNet can be made computationally more efficient by reducing the number of layers in its feature extraction network. We further show that due to the unique architecture of the CFNet, this strategy does not affect the performance of the baseline CFNet considerably. Our methodology was evaluated on 10 random sequences from the publicly available carotid artery dataset. CFNet obtained a 35.7% improvement in the average localization error over the basic ST, thus demonstrating that it is a practical and robust tracking algorithm for tracking objects in US images.
|
|
15:45-17:30, Paper WeEP-08.9 | |
Analysis of Classification Tradeoff in Deep Learning for Gastric Cancer Detection |
|
Lima, Gabriel | Faculty of Science, University of Porto |
Coimbra, Miguel | INESC TEC / Universidade Do Porto |
Dinis-Ribeiro, Mário | Instituto Português De Oncologia - Porto |
Libânio, Diogo | CIDES/CINTEIS, Faculty of Medicine, University of Porto |
Renna, Francesco | INESC TEC E Faculdade De Ciências Da Universidade Do Porto |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image classification, Image feature extraction
Abstract: This study aimed to build convolutional neural network (CNN) models capable of classifying upper endoscopy images, to determine the stage of infection in the development of a gastric cancer. Two different problems were covered. A first one with a smaller number of categorical classes and a lower degree of detail. A second one, consisting of a larger number of classes, corresponding to each stage of precancerous conditions in the Correa’s cascade. Three public datasets were used to build the dataset that served as input for the classification tasks. The CNN models built for this study are capable of identifying the stage of precancerous conditions/lesions in the moment of an upper endoscopy. A model based on the DenseNet169 architecture achieved an average accuracy of 0.72 in discriminating among the different stages of infection. The trade-off between detail in the definition of lesion classes and classification performance has been explored. Results from the application of Grad CAMs to the trained models show that the proposed CNN architectures base their classification output on the extraction of physiologically relevant image features.
|
|
15:45-17:30, Paper WeEP-08.10 | |
Collaborative Deep Learning for Privacy Preserving Diabetic Retinopathy Detection |
|
Karakaya, Mahmut | Kennesaw State University |
Aygun, Ramazan | Kennesaw State University |
Sallam, Ahmed | University of Arkansas for Medical Sciences |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Image feature extraction, Image classification
Abstract: Convolutional Neural Networks (CNNs) are an emerging research area for detection of Diabetic Retinopathy (DR) development in fundus images with highly reliable results. However, its accuracy depends on the availability of big datasets to train such a deep network. Due to the privacy concerns, the strict rules on medical data limit accessibility of images in publicly available datasets. In this paper, we propose a collaborative learning approach to train CNN models with multiple datasets while preserving the privacy of datasets in a distributed learning environment without sharing them. First, CNN networks are trained with private datasets, and tested with the same publicly available images. Based on their initial accuracies, the CNN model with the lowest performance among datasets is forwarded to second lowest performed dataset to retrain it using the transfer learning approach. Then, the retrained network is forwarded to next dataset. This procedure is repeated for each dataset from the lowest performed dataset to the highest. With this ascending chain order fashion, the network is retrained again and again using different datasets and its performance is improved over the time. Based on our experimental results on five different retina image datasets, DR detection accuracy is increased to 93.5% compared with the accuracies of merged datasets (84%) and individual datasets (73%, 78%, 83%, 85%).
|
|
15:45-17:30, Paper WeEP-08.11 | |
Fundus GAN - GAN-Based Fundus Image Synthesis for Training Retinal Image Classifiers |
|
Shenkut, Dereje Chinkil | Carnegie Mellon University |
Bhagavatula, Vijayakumar | Department of Electrical and Computer Engineering, CarnegieMello |
Keywords: Image analysis and classification - Machine learning / Deep learning approaches, Machine learning / Deep learning approaches, Image analysis and classification - Digital Pathology
Abstract: Two major challenges in applying deep learning to develop a computer-aided diagnosis of fundus images are the lack of enough labeled data and legal issues concerning patient privacy. Various efforts are being made to increase the amount of data either by augmenting training images or by synthesizing realistic-looking fundus images. However, augmentation is limited by the amount of available data and it does not address the patient privacy concern. In this paper, we propose a Generative Adversarial Network-based (GAN-based) fundus image synthesis method (Fundus GAN) that generates synthetic training images to solve the above problems. Fundus GAN is an improved way of generating retinal images by following a two-step generation process which involves first training a segmentation network to extract the vessel tree followed by vessel tree to fundus image-to-image translation using unsupervised generative attention networks. Our results show that the proposed Fundus GAN outperforms state of the art methods in different evaluation metrics. Our results also validate that generated retinal images can be used to train retinal image classifiers for eye diseases diagnosis.
|
|
WeEP-09 |
Hall 5 |
Theme 02. Other Imaging Applications - P2 |
Poster Session |
Chair: Flotho, Philipp | Saarland University Faculty of Medicine |
|
15:45-17:30, Paper WeEP-09.1 | |
Adaptive Micro-Liter Fiducials for Pre-Clinical MPI and MRI Imaging |
|
Bär, Sébastien | University of Freiburg |
Buchholz, Oliver | Universitätsklinikum Freiburg |
von Elverfeldt, Dominik | University of Freiburg |
Hofmann, Ulrich G. | University of Freiburg |
Keywords: Multimodal imaging, Multimodal image fusion, Image segmentation
Abstract: Magnetic Particle imaging (MPI) allows to measure and quantify background-free tracer distribution with a high temporal resolution. Anatomical structures are not displayed in MPI, rendering orientation within a sample error-prone and necessitating co-registration with other imaging modalities such as MRI. To support this challenge, defined external landmarks (fiducials) made from materials visible in each of the imaging modalities respectively were used in this work. Resulting signals can be aligned with the merged image containing both anatomical data and information about the tracer distribution. Defining the optimal fiducial placement is demanding and can drastically impact the 3D MPI-MRI image presentation. Here we present an adaptable 3D-printed fiducial system for preclinical co-registration of MRI and MPI data designed for easy visualisation. Clinical relevance — MPI is a promising imaging modality with many conceivable clinical applications. Simple and reliable co-registration with other imaging modalities will be crucial for a seamless transition into the clinic.
|
|
15:45-17:30, Paper WeEP-09.2 | |
Unity Human Eye Model for Gaze Tracking with a Query-Driven Dynamic Vision Sensor |
|
Tang, Serena | University of California San Diego |
Wang, Keli | University of California, San Diego |
Ogrey, Stephanie | University of California San Diego |
Villazon, Jorge | University of California, San Diego |
Khan, Sana | University of California San Diego |
Paul, Akshay | University of California San Diego |
Ardolino, Nolan | University of California San Diego |
Kubendran, Rajkumar | University of Pittsburgh |
Cauwenberghs, Gert | University of California San Diego |
Keywords: Novel imaging modalities, Image visualization, Machine learning / Deep learning approaches
Abstract: Objective measurement of gaze pattern and eye movement during untethered activity has important applications for neuroscience research and neurological disease detection. Current commercial eye-tracking tools rely on desktop devices with infrared emitters and conventional frame-based cameras. Although wearable options do exist, the large power-consumption from their conventional cameras limit true long-term mobile usage. The query-driven Dynamic Vision Sensor (qDVS) is a neuromorphic camera which dramatically reduces power consumption by outputting only intensity-change threshold events, as opposed to full frames of intensity data. However, such hardware has not yet been implemented for on-body eye-tracking, but the feasibility can be demonstrated using a mathematical simulator to evaluate the eye-tracking capabilities of the qDVS under controlled conditions. Specifically, a framework utilizing a realistic human eye model in the 3D graphics engine, Unity, is presented to enable the controlled and direct comparison of image-based gaze tracking methods. Eye-tracking based on qDVS frames was compared against two different conventional frame eye-tracking methods - the traditional ellipse pupil-fitting algorithm and a deep learning neural network inference model. Gaze accuracy from qDVS frames achieved an average of 93.2% for movement along the primary horizontal axis (pitch angle) and 93.1% for movement along the primary vertical axis (yaw angle) under 4 different illumination conditions, demonstrating the feasibility for using qDVS hardware cameras for such applications. The quantitative framework for the direct comparison of eye tracking algorithms presented here is made open-source and can be extended to include other eye parameters, such as pupil dilation, reflection, motion artifact, and more.
|
|
15:45-17:30, Paper WeEP-09.3 | |
Template-Based Balloon-Marker and Guidewire Detection for Coronary Stents in Cardiac Fluoroscopy |
|
Kotb, Ahmed | Cairo University |
Mahmoud, Ahmed M. | Cairo University Faculty of Engineering |
Rushdi, Muhammad | Cairo University |
Keywords: X-ray - Interventional radiology, Cardiac imaging and image analysis, Machine learning / Deep learning approaches
Abstract: The placement and visualization of coronary stents during fluoroscopy depends mainly on the detection of balloon markers and their connecting guidewires. In this paper, a novel template-based approach is proposed to detect balloon markers and guidewires in cardiac fluoroscopic images. In particular, guidewires are detected based on balloon markers only, without prior knowledge of the background or guidewire elements. Also, while earlier techniques used circular models of balloon markers, we propose a more realistic elliptical model. Training and the testing datasets for balloon marker and guidewire detection were collected from different Cathlab systems and annotated by an application specialist with 10 years of experience in this field. The balloon-marker detector achieved a precision of 98.5%. Within 3-pixel tolerance, the guidewire detector achieved a matching percentage of 99.5% with the true guidewire using a customized evaluation method. Moreover, the guidewire detector achieved a mean Hausdorff distance of 3.3 pixels (0.6 mm) and a longest-common-substring (LCS) distance with a mean matching percentage of 87% within 1-pixel tolerance.
|
|
15:45-17:30, Paper WeEP-09.4 | |
Lung Segmentation Reconstruction Based Data Augmentation Approach for Abnormal Chest X-Ray Images Diagnosis |
|
Wang, Zhendong | Beihang University |
Zhang, Xiaofeng | Beihang University |
Chen, Wei | Beihang University |
Niu, Jianwei | Beihang University |
Keywords: X-ray radiography, Image analysis and classification - Machine learning / Deep learning approaches
Abstract: Experienced radiologists can accurately diagnose relevant diseases by observing the cardiopulmonary region in chest X-ray images. Advances in deep learning techniques enable the prediction of lesions in chest X-ray images. However, deep learning-based algorithms usually require a large amount of training data, and it lacks an effective method for data generation and augmentation. In this paper, we propose a Lung Segmentation Reconstruction (LSR) module. A healthy chest X-ray image is generated based on the abnormal image as a reference. With the generated healthy reference, we propose a novel way of data augmentation for chest X-ray images. The whole images, lung regions and abnormal regions are stacked together and fed into a chest classification model to make a credible diagnosis. Extensive experiments have been conducted on the public dataset CheXpert. Experimental results show that our proposed abnormality enhancement can help the baseline models achieve better performance on consolidation and pleural effusion. These results highlight the potential value of the large number of healthy chest X-ray images in the dataset and the combination of different regions of chest X-ray images for prediction.
|
|
15:45-17:30, Paper WeEP-09.5 | |
A Software to Visualize, Edit, Model and Mesh Vascular Networks |
|
Decroocq, Méghane | CREATIS, Université Lyon1, CNRS UMR5220, INSERM U1206, INSA-Lyon |
Lavoué, Guillaume | LIRIS, CNRS UMR 5205, Ecole Centrale De Lyon |
Ohta, Makoto | University of Tohoku |
Frindel, Carole | Umr Cnrs 5220 - Inserm U630 |
Keywords: Image visualization, Magnetic resonance imaging - MR angiographic imaging
Abstract: Computational fluid dynamics (CFD) is a key tool for a wide range of research areas, beyond the computer science community. In particular, CFD is used in medicine to measure blood flow from patient specific models of arteries. In this field, the creation of accurate meshes remains the most challenging step, as it is based on the segmentation of medical images, a time-consuming task which often requires manual intervention by medical doctors. In this context, user-friendly, interactive softwares are valuable. They enable to spread the new advances in numerical treatment to the medical community and enrich them with the expert knowledge (e.g anatomical knowledge) of clinicians. In this work, we present a user interface dedicated to the meshing of vascular networks from centerlines. It allows for the 3D visualization and edition of input centerlines, which constitute a simplified, easy-to-manipulate representation of vascular networks. The surface of the artery can be reconstructed from the modified centerlines by an editable parametric model and then meshed with high quality hexahedral elements. At every step of the process, the network can be confronted with medical images with enhanced visualization. The software will be released publicly.
|
|
15:45-17:30, Paper WeEP-09.6 | |
Lagrangian Motion Magnification with Landmark-Prior and Sparse PCA for Facial Microexpressions and Micromovements |
|
Flotho, Philipp | Saarland University Faculty of Medicine |
Heiß, Cosmas | Institute of Mathematics, TU Berlin, Straße Des 17. Juni 136, 10 |
Steidl, Gabriele | Institute of Mathematics, TU Berlin, Straße Des 17. Juni 136, 10 |
Strauss, Daniel J. | Saarland University, Medical Faculty |
Keywords: Image visualization, Regularized image Reconstruction, Image reconstruction and enhancement - Image synthesis
Abstract: Video motion magnification methods are motion visualization techniques that aim to magnify subtle and imperceptibly small motions in videos. They fall into two main groups where Eulerian methods work on the pixel grid with implicit motion information and Lagrangian methods use explicitly estimated motion and modify point trajectories. The motion in high framerate videos of faces can contain a wide variety of information that ranges from microexpressions over pulse or respiratory rate to cues on speech and affective state. In his work, we propose a novel strategy for Lagrangian motion magnification that integrates landmark information from the face as well as an approach to decompose facial motions in an unsupervised manner using sparse PCA. We decompose the estimated displacements into different movement components that are subsequently amplified selectively. We propose two approaches: A landmark-based decomposition into global and local movements and a decomposition into multiple coherent motion components based on sparse PCA. Optical flow estimation is performed using a state-of-the-art deep learningbased method that we retrain on a microexpression database. Clinical relevance— This method could be applied to the annotation and analysis of micromovements for neurocognitive assessment and even novel, medical applications where micromotions of the face might play a role
|
|
15:45-17:30, Paper WeEP-09.7 | |
DVS-Net: Dual-Domain Variable Splitting Network for Accelerated Parallel MRI Data |
|
Ding, rui | University |
Bartlett, Joseph | University of Birmingham |
Duan, Jinming | University of Birmingham |
Duan, Yuping | Tianjin University |
Keywords: Magnetic resonance imaging - Parallel MRI, Regularized image Reconstruction
Abstract: Parallel imaging is an important method to accelerate the acquisition of magnetic resonance imaging data, which can shorten the breath-hold times and reduce motion artifacts. In this paper, we propose a joint frequency domain and image domain (dual-domain) reconstruction method by introducing the full sampling condition for the undersampled multi-coil MR data. The motivation is that the dual domain method can provide more information for accurate image reconstruction. An efficient iterative algorithm is developed based on the variable splitting technique and alternating direction method of multipliers, which is unrolled into an end-to-end trainable deep neural network. We evaluate the proposed network on complex valued multi-coil knee images for both 6-fold and 8-fold acceleration factors, and compare with both variational and deep learning based reconstruction algorithms. The numerical results demonstrate that our method provides better reconstruction accuracy and perceptual quality by making using of the dual domain information.
|
|
15:45-17:30, Paper WeEP-09.8 | |
Gastric Pacing Response Evaluated with Simultaneous Electrical and Optical Mapping |
|
Nagahawatte, Nipuni | The University of Auckland |
Zhang, Hanyu | University of Alabama at Birmingham |
Paskaranandavadivel, Niranchan | The University OfAuckland |
Patton, Haley | University of Alabama at Birmingham |
Garrett, Amy | The University of Auckland |
Angeli-Gordon, Timothy Robert | Auckland Bioengineering Institute, University of Auckland |
Nisbet, Linley | University of Auckland |
Rogers, Jack | University of Alabama in Birmingham |
Cheng, Leo K | The University of Auckland |
Keywords: Multimodal imaging, Electrical source imaging, Optical imaging
Abstract: Abstract— Gastric pacing is an attractive therapeutic approach for correcting abnormal bioelectrical activity. While high resolution (HR) electrical mapping techniques have largely contributed to the current understanding of the effect of pacing on the electrophysiological function, these mapping techniques are restricted to surface contact electrodes and the signal quality can be corrupted by pacing artifacts. Optical mapping of voltage sensitive dyes is an alternative approach used in cardiac research, and the signal quality is not affected by pacing artifacts. In this study, we simultaneously applied HR optical and electrical mapping techniques to evaluate the bioelectrical slow wave response to gastric pacing. The studies were conducted in vivo on porcine stomachs (n=3) where the gastric electrical activity was entrained using high energy pacing. The pacing response was optically tracked using voltage sensitive fluorescent dyes and electrically tracked using surface contact electrodes positioned on adjacent regions. Slow waves were captured optically and electrically and were concordant in time and direction of propagation with comparable mean velocities (6.8 ± 0.8 vs 6.4 ± 1 mm/s) and periods (30.2 ± 4.4 vs 28.7 ± 0.8 s). Importantly, the optical signals were free from pacing artifacts otherwise induced in electrical recordings highlighting an advantage of optical mapping.
|
|
15:45-17:30, Paper WeEP-09.9 | |
Enhanced Vascular Features in Porcine Gastrointestinal Endoscopy Using Multispectral Imaging |
|
Raj, Antony | HTIC IIT Madras |
Sebastin, Amalan | Healthcare Technology Innovation Centre |
Subbu, Navin | Healthcare Technology Innovation Centre |
Sreeletha Premkumar, Preejith | Healthcare Technology Innovation Center (HTIC), IIT Madras |
Sivaprakasam, Mohanasankar | Indian Institute of Technology Madras |
Keywords: Image visualization, Image enhancement
Abstract: Endoscopic investigation is a predominant standard while assessing the gastrointestinal tract. Even though it has been rigorously used in diagnostics for many decades, a high miss rate has been recorded. Advanced endoscopic imaging still has not found solutions to problems like early cancer detection, polyp generality, disease classification, etc. One of the less explored techniques to study early cancer detection is spectral imaging which deals with the absorption and reflection spectra of various wavelengths of light by different layers of tissue. To study tissues under various illumination, a multi-spectral light source unit that can be used along with an endoscopy system was developed with 10 different LEDs of very narrow bandwidths. Using this light source, a feasibility study was performed on an animal in which the upper GI tract of a porcine model was imaged and sample images were taken for processing from five different sections. Some wavelengths showed better contrast enhancements for visualization of vascular structures. Wavelength 420 nm (violet light) showed better contrast and the gradient of the line profile histogram showed the highest intensity change between the blood vessels and the surrounding mucosa. These enhancements showed that spectral imaging can potentially help in studying tissues for early cancer detection and improved visualization of the GI tract using endoscopy.
|
|
WeEP-10 |
Hall 5 |
Theme 04. Systems Modeling: Fundamentals and Applications |
Poster Session |
Chair: Athavale, Omkar Nitin | The University of Auckland |
|
15:45-17:30, Paper WeEP-10.1 | |
A Novel Ergodic Cellular Automaton Gene Network Model towards Efficient Hardware-Based Genome Simulator |
|
Shirafuji, Shogo | Hosei University |
Torikai, Hiroyuki | Hosei University |
|
|
15:45-17:30, Paper WeEP-10.2 | |
Concept Development of an On-Chip PET System |
|
Clement, Christoph | Inselspital Bern |
Birindelli, Gabriele | Inselspital Bern |
Pizzichemi, Marco | CERN |
Pagano, Fiammetta | CERN |
Kruithof-de Julio, Marianna | Inselspital Bern |
Rominger, Axel | Inselspital Bern |
Ziegler, Sibylle | Department of Nuclear Medicine, University Hospital Munich, LMU |
Auffray, Etiennette | CERN |
Shi, Kuangyu | University of Bern |
Keywords: Models of medical devices, Translational biomedical informatics - Data processing
Abstract: Organs-on-Chips (OOCs), microdevices mimicking in vivo organs, find growing applications in disease modeling and drug discovery. With the increasing number of uses comes a strong demand for imaging capabilities of OOCs. Positron Emission Tomography (PET) would be ideal for OOC imaging, however, current PET systems have insufficient spatial resolution for this task. In this work, we propose the concept of an On-Chip PET system capable of imaging OOCs. Our system consists of four detectors arranged around the OOC device. Each detector is made of two monolithic Lutetium–yttrium oxyorthosilicate (LYSO) crystals and covered with Silicon photomultipliers (SiPMs) on multiple surfaces. We use a Convolutional Neural Network (CNN) trained with data from a Monte Carlo Simulation (MCS) to predict the first gamma-ray interaction position inside the detector from the light patterns that are recorded by the SiPMs on the detector’s surfaces. With the Line of Responses (LORs) created by the predicted interaction positions, we reconstruct with Simultaneous Algebraic Reconstruction Technique (SART). The CNN achieves a mean average prediction error of 0.78 mm in the best configuration. We use the trained network to reconstruct an image of a grid of 21 point sources spread across the field-of-view and obtain a mean spatial resolution of 0.53 mm. We demonstrate that it is possible to achieve a spatial resolution of almost 0.5 mm in a PET system made of multiple monolithic LYSO crystals by directly predicting the scintillation position from light patterns created with SiPMs. We observe that CNNs from the ResNet family perform better than those from the EfficientNet family and that certain surfaces encode significantly more information for the scintillation-point prediction than others.
|
|
15:45-17:30, Paper WeEP-10.3 | |
Correlation in Dose-Response to Rapid and Long-Acting Insulin for People with Type 1 Diabetes |
|
Engell, Sarah Ellinor | Technical University of Denmark |
Aradóttir, Tinna Björk | Technical University of Denmark |
Bengtsson, Henrik | Novo Nordisk A/S |
Jorgensen, John Bagterp | Technical University of Denmark |
Keywords: Systems modeling - Clinical applications of biological networks, Systems modeling - Decision making, Modeling of cell, tissue, and regenerative medicine - PK/PD
Abstract: In diabetes, it can become necessary to switch between pump- and pen-based insulin treatment. This switch involves a translation between rapid- and long-acting insulin analogues. In standard-of-care translation algorithms, a unit-to-unit conversion is applied. However, this simplification may not fit all individuals. In this paper, we investigate the correlation between dose-response to rapid- and long-acting insulin in the same individual, and compare the correlation across individuals. As a measure of dose-response, we estimate the insulin sensitivity in clinical data from 25 subjects with type 1 diabetes. For parameter estimation, we use maximum likelihood with a continuous-discrete extended Kalman filter and Bergman's minimal model. The results show a weak correlation between insulin sensitivity to rapid- and long-acting insulin across individuals. On this sparse data set, the analysis suggests that the standardized unit-to-unit translation between insulin analogues may not benefit all subjects.
|
|
15:45-17:30, Paper WeEP-10.4 | |
Modeling the Viral Kinetics of Influenza a During Infection in Humans |
|
Hofflich, Benjamin | University of California San Diego |
Lunardhi, Alan | University of California San Diego |
Sunku, Nitesh | UC San Diego |
Tsujimoto, Jason | University of California, San Diego |
Cauwenberghs, Gert | University of California San Diego |
Keywords: Systems biology and systems medicine - Modeling of biomolecular system dynamics, Synthetic Biology - Control systems and circuits, Systems biology and systems medicine - prediction of disease related regulator
Abstract: This study explores the natural control system in the body for responding to exposure to the Influenza A virus. More specifically, it delves into the development of a model to simulate the responses of target uninfected cell counts, infected cell counts, and viral titers. There are two particular models of interest: a delayed model that incorporates the brief inactive period for newly infected cells, and a non-delayed model reflecting only infected cells without delay after initial infection. Both models are commonly used in the literature and the benefits of each model are studied and explained. We generate Simulink models for both the delayed and non-delayed sets of ordinary differential equations (ODEs) to simulate responses to different viral titer impulses. Additionally, this study aims to extrapolate these models to the case for a vaccinated individual. To do this, we modify the viral clearance rate and infected cell death rate of our initial model to account for the improved immune response generated by vaccines.
|
|
15:45-17:30, Paper WeEP-10.5 | |
Frequency Response in Splicing Regulation under mRNA Auto-Depletion Control |
|
Giaretta, Alberto | University of Cambridge |
Keywords: Systems biology and systems medicine - Modeling of gene/epigene regulatory networks, Systems biology and systems medicine - Modeling of biomolecular system dynamics, Systems biology and systems medicine - Modeling of biomolecular system pathways
Abstract: Nowadays, there exists a huge literature about stochastic model of transcriptional and translational control in gene networks. However, results related to post-transcriptional regulation via splicing and its connection with transcriptional and translational regulation are almost missing in the current literature and only related to the steady state moments investigation. Nowadays, it is becoming of paramount importance the need for modeling post-transcriptional regulation via splicing especially for DNA viruses or retroviruses. However, there exists only few studies in the literature about splicing regulation and none of them investigate its behavior in the frequency domain that can unveil important features of dynamical stochastic systems that cannot be revealed by the sole steady state moment investigation. The aim of this work is to theoretically investigate a simple gene network subject to splicing regulation with negative feedback control, implemented through mRNA auto-depletion under a frequency domain perspective. This study showed the pivotal role of the burst size, enhancing the noise power spectrum, as well as the splicing conversion rates capable to increase and decrease the noise power spectrum in the pre-mRNA and mRNA, respectively, for high values of conversion rates. Importantly, it shows the capability of the mRNA autodepletion control to modulate the noise as a frequency-dependent amplifying control as a function of the negative feedback strengths.
|
|
15:45-17:30, Paper WeEP-10.6 | |
Stochastic Resonance Governs Memory Consolidation Accuracy in a Neural Network Model |
|
Caston, Rose | University of Utah |
Wilson, Matthew | University of Utah |
Comeaux, Phillip | University of Utah |
Dorval, Alan | University of Utah |
Keywords: Computational modeling - Biological networks, Model building - Signal and pattern recognition
Abstract: The formation and recollection of memories is a multi-step neural process subject to errors. We propose a computational model of memory nodes receiving input from a colored tic-tac-toe board. We report memory errors during consolidation and reconsolidation when different noise levels are introduced into the model. The model is based on Hebbian plasticity and attempts to store the color and position of an X or O from the board. Memory nodes simulating neurons use an integrate-and-fire model to represent the correct or incorrect storage of the board information by scaling synaptic weights. We explored how baseline firing rate, which we considered analogous to noise in storing memory, impacted the creation of correct and incorrect memories. We found that a higher firing rate was associated with fewer accurate memories. Interestingly, the ideal amount of noise for correct memory storage was nonzero. This phenomenon is known as stochastic resonance, wherein random noise enhances processing. We also examined how many times our model could reactivate a memory before making an error. We found an exponentially decaying response, with a low firing rate yielding more stable memories. Even though our model incorporates only two memory nodes, it provides a basis for examining the consolidation and retrieval of memory storage based on the unique visual input of a tic-tac-toe board. Further work may incorporate different inputs, more nodes, and increased network complexity.
|
|
15:45-17:30, Paper WeEP-10.7 | |
Using Operative Reports to Predict Heart Transplantation Survival |
|
Klang, Marcus | Dept. of Computer Science, Lund University |
Díaz, Daniel | Lund University |
Medved, Dennis | Lund University |
Nugues, Pierre | Lund University |
Nilsson, Johan | Dept. Clinical Sciences Lund, CardioThoracic Surgery, Lund Unive |
Keywords: Systems modeling - Decision making, Translational biomedical informatics - Decision making, Translational biomedical informatics - Mining clinical data
Abstract: Heart transplantation is a difficult procedure compared with other surgical operations, with a greater outcome uncertainty such as late rejection and death. We can model the success of heart transplants from predicting factors such as the age, sex, diagnosis, etc., of the donor and recipient. Although predictions can mitigate the uncertainty on the transplantation outcome, their accuracy is far from perfect. In this paper, we describe a new method to predict the outcome of a transplantation from textual operative reports instead of traditional tabular data. We carried out an experiment on 300 surgical reports to determine the survival rates at one year and five years. Using a truncated TF-IDF vectorization of the texts and logistic regression, we could reach a macro F1 of 59.1%, respectively, 54.9% with a five-fold cross validation. While the size of the corpus is relatively small, our experiments show that the operative textual sources can discriminate the transplantation outcomes and could be a valuable additional input to existing prediction systems.
|
|
15:45-17:30, Paper WeEP-10.8 | |
Low Cardiac Frequency Associated with Higher Number of Extrasistoles in a Computational Model of Brugada Syndrome |
|
Seghetti, Paolo | Scuola Superiore Sant'Anna |
Biasi, Niccolò | University of Pisa |
Laurino, Marco | National Research Council |
tognetti, alessandro | University of Pisa |
|
|
15:45-17:30, Paper WeEP-10.9 | |
Mathematical Modeling of Gastric Slow Waves During Electrical Field Stimulation |
|
Athavale, Omkar Nitin | The University of Auckland |
Cheng, Leo K | The University of Auckland |
Clark, Alys | The University of Auckland |
Avci, Recep | The University of Auckland |
Du, Peng | The University of Auckland |
Keywords: Modeling of cell, tissue, and regenerative medicine - Ionic modeling, Data-driven modeling, Model building - Sensitivity analysis
Abstract: While neural modulation has been trialed as a therapy for functional gastric motility disorders, a computational model that guides stimulation protocol does not exist. In this work, a mathematical model of gastric slow wave activity, which incorporates the effects of neurotransmitter release during electrical field stimulation (EFS), was developed. Slow wave frequency responses due to the release of acetylcholine and slow wave amplitude responses due to the release of nitric oxide were modeled. The model was calibrated using experimental data from literature. A sensitivity analysis was conducted, which showed that the model yielded stable, periodic solutions for EFS frequencies in the range 0 – 20 Hz. A 25% increase in the input parameter (EFS frequency) from 5 Hz to 6.25 Hz resulted in a 5.2% increase in slow wave frequency and a 3.2% decrease in slow wave amplitude. Simulated EFS showed that, for stimulation at 15 Hz, with blocking of the nitrergic neurotransmitter pathway the slow wave increased from the no stimulation scenario in frequency by only 2.4 compared to 2.7 when the nitrergic pathway was not blocked. A 21% reduction in slow wave amplitude occurred when the cholinergic pathway was blocked, compared to a 46% reduction when no neurotransmitter pathways were blocked.
|
|
15:45-17:30, Paper WeEP-10.10 | |
Luteinizing Hormone Dynamics in Menstruation |
|
Lee, Irene | University of California, San Diego |
Prabhu, Swathi | University of California, San Diego |
Singhal, Meenakshi | UC San Diego |
Tor, Alice | UC San Diego |
Cauwenberghs, Gert | University of California San Diego |
Keywords: Systems modeling - Clinical applications of biological networks, Model building - Sensitivity analysis
Abstract: Menstruation, the cyclic shedding of the uterine layer, is a complex control system. Relevant hormones dictate numerous negative and positive feedback mechanisms to determine the latency, duration, and intensity of each menstrual stage. Consequently, engineers can model the larger control system of menstruation into subsections for each key hormone in order to elucidate the cascading effects and causes of each menstrual event. The hypothalamus, pituitary gland, and ovaries orchestrate the process by producing and delivering the key hormones via the bloodstream. Gonadotropin-releasing hormone (GnRH) from the hypothalamus stimulates the pituitary gland to release luteinizing hormone (LH) and follicle-stimulating hormone (FSH). LH, in turn, cascades to stimulate the release of another important hormone, progesterone. Numerous other factors affect menstruation, but the interactions of these four key hormones - GnRH, LH, FSH, and progesterone - play the most major role in menstrual regulation. The menstrual cycle has two main phases: follicular and luteal. The follicular phase lasts about 14 days, during which the levels of GnRH increase, the levels of FSH initially increase and then decrease, and the levels of LH remain low and steady. During the luteal phase, the levels of GnRH decrease in response to ovulation, causing levels of FSH and LH to subsequently decrease as well. The levels of LH are determined by the concentration of estrogen in the blood and the phase of the menstrual cycle. During the follicular phase, low levels of estrogen have a negative feedback relationship with LH and explain the decreasing levels of FSH and low levels of LH. On the other hand, during the luteal phase, high levels of estrogen share a positive feedback relationship with LH and consequently results in an exponential increase in LH.
|
|
WeEP-11 |
Hall 5 |
Theme 05. Cardiovascular Disease |
Poster Session |
Chair: Yu, Yih-Choung | Lafayette College |
|
15:45-17:30, Paper WeEP-11.1 | |
Thrombus Detection in the Maglev Blood Pump to Distinguish from Viscosity Change by the Circular Orbital Oscillation of the Impeller (withdrawn from program) |
|
Hatakenaka, Kohei | Tokyo Institute of Technology |
Hijikata, Wataru | Tokyo Institute of Technology |
Fujiwara, Tatsuki | Tokyo Medical and Dental University, Japan |
Ohuchi, Katsuhiro | Tokyo Medical and Dental University |
Inoue, Yusuke | Asahikawa Medical University |
|
15:45-17:30, Paper WeEP-11.2 | |
Estimation of Characteristic Impedance Using Multi-Gaussian Modelled Flow Velocity Waveform: A Virtual Subjects Study |
|
Manoj, Rahul | Indian Institute of Technology Madras |
V, Raj Kiran | IIT Madras |
P M, Nabeel | Indian Institute of Technology Madras |
Sivaprakasam, Mohanasankar | Indian Institute of Technology Madras |
Joseph, Jayaraj | HTIC, Indian Institute of Technology Madras |
Keywords: Cardiovascular and respiratory system modeling - Blood flow models, Cardiovascular and respiratory signal processing - Cardiovascular signal processing
Abstract: Characteristic impedance (ZC) of the blood vessel relates the pulsatile pressure to pulsatile blood flow velocity devoid of any wave reflections. Estimation of ZC is crucial for evaluating local pulse wave velocity and in solving wave separation analysis (WSA) which separates the forward-backward pressure and flow velocity waveforms. As opposed to conventional WSA, which requires simultaneous measurement of pressure and flow velocity waveform, simplified WSA relies on modelled flow velocity waveforms. This work uses a multi-Gaussian decomposition (MGD) modelled flow velocity waveform to estimate ZC by employing a frequency domain analysis. Thus obtained ZC is compared with ZC estimated from true flow velocity waveform for healthy (virtual) subjects for the carotid artery. The MGD modelled flow velocity waveform was able to estimate ZC for a range of 4.98 to 34.79 (in arbitrary units) with a group average of 16.43±0.10. The difference between the group average values of both Zc was only 4.72%. A statistically significant and strong correlation (r = 0.708, p < 0.0001) was observed between ZC obtained from MGD modelled flow velocity waveform and ZC obtained from actual flow velocity waveform. The bias for ZC between the two methods was 0.74 with confidence intervals (CIs) between 7.44 and -5.96 for the Bland-Altman analysis. Therefore, ZC from MGD modelled flow velocity waveform is a potential surrogate of the flow velocity model for WSA at the carotid artery.
|
|
15:45-17:30, Paper WeEP-11.3 | |
Cardiovascular Dynamics in COVID-19: A Heart Rate Variability Investigation |
|
Aliani, Cosimo | University of Florence |
Bocchi, Leonardo | Università Degli Studi Di Firenze, Firenze, Italy |
Lanata', Antonio | University of Florence |
Rossi, Eva | University of Florence |
Spina, Rosario | Ospedale San Giuseppe Empoli |
Calamai, Italo | Ospedale San Giuseppe Empoli |
Luchini, Marco | Ospedale San Giuseppe Empoli |
Deodati, Rossella | Ospedale S. Giuseppe |
Keywords: Cardiovascular and respiratory signal processing - Cardiovascular signal processing, Cardiovascular and respiratory signal processing - Complexity in cardiovascular or respiratory signals, Cardiovascular regulation - Heart rate variability
Abstract: COVID-19 is known to be a cause of microvascular disease due, for example, to the cytokine storm inflammatory response and the result of blood coagulation. This study reports an investigation on Heart Rate Variability (HRV) extracted from photoplethysmography (PPG) signals measured from healthy subjects and COVID-19 affected patients. We aimed to determine a statistical difference between HRV parameters among subjects’ groups. Specifically, statistical analysis through Mann-Whitney U Test (MWUT) was applied to compare 42 different parameters extracted from PPG signals of 143 subjects: 50 healthy subjects (i.e. group 0) and 93 affected from COVID-19 patients stratified through increasing COVID severity index (i.e. groups 1 and 2). Results showed significant statistical differences between groups in several HRV parameters. In particular, Multiscale Entropy (MSE) analysis provided the master key in patient stratification assessment. In fact, MSE11, MSE12, MSE15, MSE16, MSE17, MSE18, MSE19 and MSE20 keep statistical significant difference during all the comparisons between healthy subjects and patients from all the pathological groups. Our preliminary results suggest that it could be possible to distinguish between healthy and COVID-19 affected subjects based on cardiovascular dynamics. This study opens to future evaluations in using machine learning models for automatic decision-makers to distinguish between healthy and COVID-19 subjects, as well as within COVID-19 severity levels. Clinical Relevance: This establishes the possibility to distinguish healthy subjects from COVID-19 affected patients basing on HRV parameters monitored non invasively by PPG.
|
|
15:45-17:30, Paper WeEP-11.4 | |
Numerical and Experimental Analysis for a Magnetic Levitation System in a Hemocompatibility Assessment Platform |
|
Tedesco V, Victor | Texas Heart Institute |
Kiang, Simon | Rice University |
Karnik, Shweta | Texas Heart Institute |
Smith, P. Alex | Texas Heart Institute at St. Luke's Hospital |
Nissim, Lee | University of Bath |
Fraser, Katharine H. | University of Bath |
Kurita, Nobuyuki | Gunma University |
Frazier, O.H. | Texas Heart Institute @ St. Luke's Hospital |
Wang, Yaxin | Texas Heart Institute |
Keywords: Cardiac mechanics, structure & function - Ventricular assist devices
Abstract: Development of pediatric left ventricular assist devices (LVADs) has lagged behind that of adult LVADs, primarily due to the size and hemocompatibility constraints of pediatric anatomy. To quantify sources of blood trauma during LVAD development, we proposed a hemocompatibility assessment platform (HAP) that can evaluate the hemocompatibility of individual components of LVADs. To eliminate the hemolysis induced by the HAP itself, we incorporated passive magnetic (PM) bearings to suspend the rotor radially and an active magnetic bearing (AMB) to control the axial position. In this study, we numerically evaluated AMB forces of 2 geometries and validated the model by comparing its predictions with experimental results. The magnetic forces generated by the AMB were evaluated by increasing the rotor-stator gap from 0.1 mm to 0.5 mm with a 0.1 mm increment and by varying the coil current from -2 A to 2 A with a 1 A increment. The average error of the numerical models was 8.8% and 7.0% for the two geometries, respectively. Higher errors were found at smaller (<0.2mm) rotor-stator gaps. For both biasing ring sizes, the AMB exhibits high magnetic stiffness from -1 A to 1 A, though it saturates for currents of -2 A and 2 A. This region of high current stiffness was identified as the optimal control region. In future work, this function will be used to tune a control algorithm to modulate current supplied to the AMB, ultimately stabilizing the rotor axially.
|
|
15:45-17:30, Paper WeEP-11.5 | |
Detecting Sepsis from Photoplethysmography: Strategies for Dataset Preparation |
|
Lombardi, Sara | University of Florence |
Partanen, Petri | University of Oulu |
Bocchi, Leonardo | Università Degli Studi Di Firenze, Firenze, Italy |
Keywords: Cardiovascular and respiratory signal processing - Cardiovascular signal processing
Abstract: Sepsis is one of the most frequent causes of death in Intensive Care Units, and its prognosis greatly depend on timeliness of diagnosis. MIMIC-III database is a frequent source of data for developing method for automatic sepsis detection. However, the heterogeneity of data jeopardize the feasibility of the task. In this work we propose a selection strategy for generating high quality data suitable for training a sepsis detection system based on the utilization of only plethysmographic data. Clinical relevance: A system for detecting sepsis based only on PPG may be potentially at virtually no cost in any case clinicians suspect the possibility of developing sepsis.
|
|
15:45-17:30, Paper WeEP-11.6 | |
Minimally Invasive Monitoring of Cardiac Function for Patients with Rotary VAD Support, a Frequency Domain Approach |
|
Yu, Yih-Choung | Lafayette College |
Rakotozandry, Tafita | Lafayette College |
Adem, Robson | University of Central Florida, Boston Dynamics |
Kosednar, Sophia | Lafayette College |
Keywords: Cardiac mechanics, structure & function - Ventricular assist devices, Cardiovascular and respiratory signal processing - Cardiovascular signal processing, Vascular mechanics and hemodynamics - Arterial pressure in cardiovascular disease
Abstract: A new cardiac function estimation algorithm has been developed to monitor a patient’s myocardial contractility while supported by a rotary ventricular device (VAD). This algorithm uses the raw air pressure signal from the cuff pressure sensor, filters the signal with a bandpass filter, and then processes the signal through the Fast Fourier Transform to detect the first and second highest magnitude components. A systematic study by using a computer model to simulate the interaction between the cardiovascular system and the rotary VAD under different contractual states of the heart (failure, recovery, and healthy) demonstrated that these two magnitude components increased when the healthy status of the heart improved. Determination of these two magnitude components does not need any indwelling sensor but an air pressure cuff sensor. Performing this test does not require any interruption of regular rotary VAD operation or cardiac care facility. Successful development of this algorithm would allow more frequent monitoring for patients with less concerns of safety or examination cost, which could potentially improve the outcome of weaning patients from VAD support.
|
|
15:45-17:30, Paper WeEP-11.7 | |
Dynamic Evaluation of an Active Axial Magnetic Levitated Bearing System in a Hemocompatibility Assessment Platform |
|
Kurita, Nobuyuki | Gunma University |
Ogiwara, Eiji | Gunma University |
Luo, Neil | Texas Heart Institute |
Kiang, Simon | Rice University |
Karnik, Shweta | Texas Heart Institute |
Smith, Peter Alex | TYBR Health, Inc |
Nissim, Lee | University of Bath |
Fraser, Katharine H. | University of Bath |
Frazier, O.H. | Texas Heart Institute @ St. Luke's Hospital |
Wang, Yaxin | Texas Heart Institute |
Keywords: Cardiac mechanics, structure & function - Ventricular assist devices
Abstract: To evaluate the hemocompatibility of individual components of our pediatric left ventricular assist device (LVAD), we proposed a hemocompatibility assessment platform (HAP) with a magnetic levitated bearing system. The HAP consists of a drive system utilizing a brushless direct current (BLDC) motor, passive magnetic bearings (PMB), and an active magnetically levitated bearing (AMB) to reduce the hemolysis generated by HAP itself. In this study, we designed and evaluated the performance of the AMB by measuring radial and axial displacements of the rotor resulting from radially destabilizing forces as well as the performance of the drive system when rotated at increasing speeds to 1,200 rotations per minute (rpm). The results show that, with radial disturbance, the AMB is capable of maintaining axial stability for the BLDC motor system. The AMB can control up to 1,200 rpm without any contact between the rotor and stator. Future work includes geometry optimization for the AMB structure and increase the capability to control stable high-speed rotation for the entire system.
|
|
15:45-17:30, Paper WeEP-11.8 | |
An Impedance Sensor for Pathologically Relevant Detection of In-Stent Restenosis in Vitro |
|
Hoare, Daniel | University of Glasgow |
Fisher, Simon | University of Glasgow |
Nelson, Finlay | University of Glasgow |
Tsiamis, Andreas | University of Edinburgh |
Marland, Jamie | University of Edinburgh |
Mitra, Srinjoy | University of Edinburgh |
Neale, Steven | University of Glasgow |
Mercer, John | University of Glasgow |
Keywords: Smart pacemaker and implanted defibrillator, Cardiovascular, respiratory, and sleep devices - Sensors, Cardiovascular, respiratory, and sleep devices - Implantables
Abstract: Cardiovascular disease (CVD) is the biggest cause of death globally. CVD is caused by atherosclerosis which is the accumulation of fatty deposits, often within the fine arteries of the heart or brain. These blockages reduce blood flow and lead to oxygen starvation (ischemia) which can lead to heart attacks and strokes. To treat blocked arteries an implantable device called a stent re-opens the artery to reinstate blood flow to the organ. The stent itself can become blocked over time by cell growth (intimal hyperplasia) which is characterised by excessive smooth muscle cell proliferation. Sensors based on electrical impedance spectroscopy (EIS) embedded in a stent could detect this reblocking to allow for early intervention. Using platinum interdigitated electrodes on silicon sensor wafers we were able to co-culture different ratios of mouse smooth muscle cells and mouse endothelial cells on these sensors. This mimics the complex, multicellular environment which a stent is found in vivo when undergoing neo-intimal hyperplasia. Trends in the cell impedances were then characterised using the detection frequency and the gradient of change between populations over time which we termed 'Peak Cumulative Gradients (PCG). PCGs were calculated to successfully discriminate each cell type. This work moves towards a sensor that may help guide clinician's decision-making in a disease that is historically silent and difficult to detect.
|
|
WeEP-12 |
Hall 5 |
Theme 06. Machine Learning, Brain Signal Processing for Neurorehabilitation
& Neural Engineering II |
Poster Session |
Chair: Toth, Jake | University of Sheffield |
|
15:45-17:30, Paper WeEP-12.1 | |
Inter-Patient Seizure Detection by Brain-Connectivity Analysis Using Dynamic Graph Isomorphism Network |
|
Tianli, Tao | ShanghaiTech University |
Guo, Lianghu | Shanghaitech |
He, Qiang | United Imaging Healthcare |
Zhang, Han | ShanghaiTech University |
Xu, Lin | ShanghaiTech University |
Keywords: Neurological disorders - Epilepsy, Neurological disorders - Diagnostic and evaluation techniques, Brain functional imaging - EEG
Abstract: Epilepsy is a neurological disease caused by abnormal neural electrical discharges. Electroencephalography (EEG) is a powerful tool to measure the brain electrical activity and has been widely used for seizure detection. Manual EEG analysis is labor-intensive and time-consuming. Automatic seizure detection is urgently demanded for long-time seizure monitoring. Many methods have been proposed for automatic seizure detection based on EEG signals. However, most of the existing methods are patient-specific with limited generalizability. Few studies investigate inter-patient seizure detection, which remains challenging. The aim of the present study is therefore to develop advanced algorithms for efficient inter-patient seizure detection using EEG. To this end, dynamic brain network is employed to capture the spatiotemporal dynamics of the connectivity among brain regions. A novel graph neural network referred to as graph isomorphic network is proposed for effective local-global spatiotemporal feature extraction and seizure classification. The proposed method is evaluated with the CHB-MIT open dataset with a ten-fold cross-validation. The results reveal excellent performance for the proposed method, with accuracy, sensitivity, and specificity of 96.2%, 95.4%, and 97.0% respectively, significantly higher than the results reported in the literature. Our results provide useful information for inter-patient seizure detection, particularly for long-time ambulatory seizure monitoring.
|
|
15:45-17:30, Paper WeEP-12.2 | |
Complexity Modulation in Functional Brain-Heart Interplay Series Driven by Emotional Stimuli: An Early Study Using Fuzzy Entropy |
|
Catrambone, Vincenzo | Università Di Pisa |
Patron, Elisabetta | Università Di Padova |
Gentili, Claudio | University of Pisa |
Valenza, Gaetano | University of Pisa |
Keywords: Neural signal processing, Brain physiology and modeling - Nonlinear coupling, Brain functional imaging - EEG
Abstract: Increasing attention has recently been devoted to the multidisciplinary investigation of functional brain-heart interplay (BHI), which has provided meaningful insights in neuroscience and clinical domains including cardiology, neurology, clinical psychology, and psychiatry. While neural (brain) and heartbeat series show high nonlinear and complex dynamics, a complexity analysis on BHI series has not been performed yet. To this end, in this preliminary study, we investigate BHI complexity modulation in 17 healthy subjects undergoing a 3-minute resting state and emotional elicitation through standardized image slideshow. Electroencephalographic and heart rate variability series were the inputs of an ad-hoc BHI model, which provides directional (from-heart-to-brain and from-brain-to-heart) estimates at different frequency bands. A Fuzzy entropy analysis was performed channel-wise on the model output for the two experimental conditions. Results suggest that BHI complexity increases in the emotional elicitation phase with respect to a resting state, especially in the functional direction from the heart to the brain. We conclude that BHI complexity may be a viable computational tool to characterize neurophysiological and pathological states under different experimental conditions.
|
|
15:45-17:30, Paper WeEP-12.3 | |
Levodopa-Dependent Differences in the Non-Oscillatory Activity of the Subthalamic Nucleus |
|
Ferrara, Rosanna | Università Degli Studi Di Milano |
Bianchi, Anna Maria | Politecnico Di Milano |
Priori, Alberto | Università Di Milano, Fondazione IRCCS Ospedale MaggiorePoliclin |
Coelli, Stefania | Department of Electronics, Information and Bioengineering, Polit |
Averna, Alberto | Università Degli Studi Di Milano |
Keywords: Neural signals - Nonlinear analysis, Neural signal processing, Neurological disorders
Abstract: The study of local field potentials (LFP) recorded from the basal ganglia of patients with movement disorders led to significant advancement in the understanding the pathophysiology of Parkinson’s disease (PD). The possibility of investigating possible changes in the activity of the brain caused by the levodopa administration may provide a useful tool to evaluate the influence or the side-effects of the treatment from patient to patient. The analysis was carried out through a systematic analysis of the fractal component of the subthalamic local field potentials (STN-LFP) that may reveal, with respect to the classical power spectrum analysis, novel important information about the dynamic modulation caused by the drug intake. Indeed, so far, much of what is known about that is related to the presence of a spectral peak in the beta frequency band then attenuated after the levodopa administration. The nonlinear power-law exponent goes beyond this feature, exploring differences that reflect the fractal (scale-free) behavior of the PD brain dynamics. Here, in order to demonstrate that the presence or absence of the peak has no effect on the computation of the power-law exponent, we used simulated LFP recordings. After that, we performed the fractal analysis in shorts epochs of STN LFPs recordings (N=24 patients, 12 females and 12 males) before and after Levodopa administration. We found no differences in the nonlinear power-law exponent for simulated data, reinforcing the idea that the parameter was not influenced by the attenuation of the hallmark peak for PD patients. As regard real LFP time series, we found that pharmacological treatment for PD differently altered LFP power of non-oscillatory activity, as well as changed the level of fractal exponent in specific frequency bands. Particularly we observed an increase of the fractal exponent in condition of post-levodopa with significant differences related to the response to levodopa in Parkinson’s disease.
|
|
15:45-17:30, Paper WeEP-12.4 | |
Using Neurofeedback from Steady-State Visual Evoked Potentials to Target Affect-Biased Attention in Augmented Reality |
|
Huang, Xiaofei | Northeastern University |
Mak, Jennifer | University of Pittsburgh |
Wears, Anna | University of Pittsburgh |
Price, Rebecca B. | University of Pittsburgh |
Akcakaya, Murat | University of Pittsburgh |
Ostadabbas, Sarah | Northeastern University |
Woody, Mary L. | University of Pittsburgh |
Keywords: Brain-computer/machine interface, Brain functional imaging - EEG, Brain functional imaging - Evoked potentials
Abstract: Biases in attention to emotional stimuli (i.e., affect-biased attention) contribute to the development and maintenance of depression and anxiety and may be a promising target for intervention. Past attempts to therapeutically modify affect-biased attention have been unsatisfactory due to issues with reliability and precision. Electroencephalogram (EEG)-derived steady-state visual evoked potentials (SSVEPS) provide a temporally-sensitive biological index of attention to competing visual stimuli at the level of neuronal populations in the visual cortex. SSVEPS can potentially be used to quantify whether affective distractors vs. task-relevant stimuli have ``won'' the competition for attention at a trial-by-trial level during neurofeedback sessions. This study piloted a protocol for a SSVEP-based neurofeedback training to modify affect-biased attention using a portable augmented-reality (AR) EEG interface. During neurofeedback sessions with five healthy participants, significantly greater attention was given to the task-relevant stimulus (a Gabor patch) than to affective distractors (negative emotional expressions) across SSVEP indices (p<0.0001). SSVEP indices exhibited excellent internal consistency as evidenced by a maximum Guttman split-half coefficient of 0.97 when comparing even to odd trials. Further testing is required, but findings suggest several SSVEP neurofeedback calculation methods most deserving of additional investigation and support ongoing efforts to develop and implement a SSVEP-guided AR-based neurofeedback training to modify affect-biased attention in adolescent girls at high risk for depression.
|
|
15:45-17:30, Paper WeEP-12.5 | |
Complexity-Based Encoded Information Quantification in Neurophysiological Recordings |
|
Fuhrer, Julian | University of Oslo |
Blenkmann, Alejandro Omar | University of Oslo |
Tor Endestad, Tor | University of Oslo |
Solbakk, Anne-Kristin | University of Oslo |
Glette, Kyrre | University of Oslo |
Keywords: Brain functional imaging - EEG, Neural signals - Information theory, Brain functional imaging - Evoked potentials
Abstract: Brain activity differs vastly between sleep, cognitive tasks, and action. Information theory is an appropriate concept to analytically quantify these brain states. Based on neurophysiological recordings, this concept can handle complex data sets, is free of any requirements about the data structure, and can infer the present underlying brain mechanisms. Specifically, by utilizing algorithmic information theory, it is possible to estimate the absolute information contained in brain responses. While current approaches that apply this theory to neurophysiological recordings can discriminate between different brain states, they are limited in directly quantifying the degree of similarity or encoded information between brain responses. Here, we propose a method grounded in algorithmic information theory that affords direct statements about responses' similarity by estimating the encoded information through a compression-based scheme. We validated this method by applying it to both synthetic and real neurophysiological data and compared its efficiency to the mutual information measure. This proposed procedure is especially suited for task paradigms contrasting different event types because it can precisely quantify the similarity of neuronal responses.
|
|
15:45-17:30, Paper WeEP-12.6 | |
Cortico-Muscular Coupling Allows to Discriminate Different Types of Hand Movements |
|
de Seta, Valeria | Sapienza University of Rome |
Colamarino, Emma | Sapienza University of Rome |
Cincotti, Febo | Sapienza University of Rome |
Mattia, Donatella | Fondazione Santa Lucia IRCCS |
Mongiardini, Elena | Sapienza University of Rome |
Pichiorri, Floriana | Fondazione Santa Lucia, IRCCS, Rome, Italy |
Toppi, Jlenia | University of Rome "Sapienza" |
Keywords: Brain-computer/machine interface, Brain functional imaging - Connectivity and information flow, Neural signals - Machine learning & Classification
Abstract: Cortico-muscular coupling (CMC) could be used as potential input of a novel hybrid Brain-Computer Interface (hBCI) for motor re-learning after stroke. Here, we aim of addressing the design of a hBCI able to classify different movement tasks taking into account the interplay between the cerebral and residual or recovered muscular activity involved in a given movement. Hence, we compared the performances of four classification methods based on CMC features to evaluate their ability in discriminating finger extension from grasping movements executed by 17 healthy subjects. We also explored how the variation in the dimensionality of the feature domain would influence the different classifier performances. Results showed that, regardless of the model, few CMC features (up to 10) allow for a successful classification of two different movements type. Moreover, support vector machine classifier with linear kernel showed the best trade-off between performances and system usability (few electrodes). Thus, these results suggest that a hBCI based on brain-muscular interplay holds the potential to enable more informed neural plasticity and functional motor recovery after stroke. Furthermore, this CMC-based BCI could also allow for a more “natural control” (i.e., that resembling physiological control) of prosthetic devices.
|
|
15:45-17:30, Paper WeEP-12.7 | |
Selecting an Effective Amplitude Threshold for Neural Spike Detection |
|
Zhang, Zheng | Imperial College London |
Constandinou, Timothy | Imperial College of Science, Technology and Medicine |
Keywords: Neural signal processing, Brain-computer/machine interface
Abstract: This paper assesses and challenges whether commonly used methods for defining amplitude thresholds for spike detection are optimal. This is achieved through empirical testing of single amplitude thresholds across multiple recordings of varying SNR levels. Our results suggest that the most widely used noise-statistics-driven threshold can suffer from parameter deviation in different noise levels. The spike-noise-driven threshold can be an ideal approach to set the threshold for spike detection, which suffers less from the parameter deviation and is robust to sub-optimal settings.
|
|
15:45-17:30, Paper WeEP-12.8 | |
Corticomuscular Connectivity During Walking in Able Bodied and Individuals with Incomplete Spinal Cord Injury |
|
Saleh, Soha | Kessler Foundation |
Glassen, Michael | Kessler Foundation |
Momeni, Kamyar | Kessler Foundation |
Ravi, Manikandan | Kessler Foundation |
Bheemreddy, Akhil | Kessler Foundation |
Hoxha, Armand | Kessler Foundation |
Garbarini, Erica | Kessler Foundation |
Yue, Guang | Kessler Foundation |
Forrest, Gail F | Kessler Foundation |
Keywords: Brain functional imaging - EEG, Human performance - Sensory-motor, Brain functional imaging - Connectivity and information flow
Abstract: This exploratory study used EEG as mobile imaging method to study cortico-muscular connectivity (CMC) during walking in able-bodied individuals (AB) and individuals with spinal cord injury (iSCI), while walking with and without exoskeleton walking robot (EWR) assistance. We also explored change in CMC after intensive training using EWR assistance in iSCI. Results showed no different in CMC within the AB group during walking with and without robot assistance. However, before training the iSCI subjects showed lower CMC during walking with robot assistance. The intensive 40 hours of walking training with EWR improved the walking function in iSCI participants allowing them to walk with robot assistance set to lower assistance level. This decrease in assistance level and improvement in walking function correlated with increase in CMC, reducing the difference in CMC during walking with and without EWR assistance. The findings suggest that high level of robot assistance and low walking function in iSCI correlates with weaker connectivity between primary motor cortices and lower extremity muscles. Further research is needed to better understand the importance of intention and cortical involvement in training of walking function using EWRs.
|
|
15:45-17:30, Paper WeEP-12.9 | |
Improved Grip Force Prediction Using a Loss Function That Penalizes Reward Related Neural Information |
|
ASOK KUMAR, JAGANTH NIVAS | University of Houston |
Francis, Joseph Thachil | University of Houston |
Keywords: Neural signals - Machine learning & Classification, Brain-computer/machine interface, Motor neuroprostheses - Prostheses
Abstract: Abstract—Neural activity in the sensorimotor cortices has been previously shown to correlate with kinematics, kinetics, and non-sensorimotor variables, such as reward. In this work, we compare the grip force offline Brain Machine Interface (BMI) prediction performance, of a simple artificial neural network (ANN), under two loss functions: the standard mean squared error (MSE) and a modified reward penalized mean squared error (RP_MSE), which penalizes for correlation between reward and grip force. Our results show that the ANN performs significantly better under the RP_MSE loss function in three brain regions: dorsal premotor cortex (PMd), primary motor cortex (M1) and the primary somatosensory cortex (S1) by approximately 6%.
|
|
15:45-17:30, Paper WeEP-12.10 | |
Electrophysiological Correlates of Response Time in a Vigilant Attention Task |
|
Toth, Jake | University of Sheffield |
Patel, Ricken | University of Sheffield |
Arvaneh, Mahnaz | University of Sheffield |
Keywords: Human performance - Attention and vigilance, Brain functional imaging - EEG, Brain-computer/machine interface
Abstract: Early detection of a deficit in vigilant attention can allow for user notification or intervention. In this paper, Electrophysiological correlates of vigilant attention from a random-dot motion task were explored. Using only frontal (Fz) and parietal (Pz) EEG channels, spectral features of response time were determined. Notably, significant differences in high beta, gamma and alpha frequency bands were found between fast and slow reaction times. These results are interpreted in line with the relevant literature on arousal, off-task thought and active visuospatial attentional suppression. The presence of response-locked time-domain features was analysed. However, motor-related features obfuscated these features.
|
|
WeEP-13 |
Hall 5 |
Theme 06. Stimulation of Neural Tissues |
Poster Session |
Chair: Purcell, Erin | Michigan State University |
|
15:45-17:30, Paper WeEP-13.1 | |
Non-Invasive Stable Sensory Feedback for Closed-Loop Control of Hand Prosthesis |
|
Zhang, Jie | Shanghai Jiao Tong University |
Chou, Chih-Hong | Shanghai Jiao Tong University |
Wu, Xiaoting | Institute of Semiconductors, CAS |
Pei, Weihua | Institute of Semiconductors, CAS |
Lan, Ning | Shanghai Jiao Tong University |
Keywords: Sensory neuroprostheses, Neurorehabilitation, Neural stimulation
Abstract: The absence of somatotopic sensory feedback limits the function of conventional prosthetic hands. In this study, we integrated a non-invasive sensory feedback system into a commercial Bebionic hand with new customized surface stimulation electrodes. Multiple modalities of tactile and hand aperture sensory information were conveyed to the amputee via the technique of evoked tactile sensation (ETS) elicited at projected finger map (PFM) of residual limb and an additional electrotactile stimulation in the ipsilateral upper arm. A previously developed anti-stimulus artifact module was used to remove the stimulus artifact from surface electromyographic (sEMG) signals, and the filtered sEMG envelops controlled the speed of open/close of the Bebionic hand. The Ag/AgCl surface stimulation electrode in 10-mm diameter was specially designed to fit the restricted PFM areas for stable perception. We evaluated the alternating-current (AC) impedance magnitude of this electrode stimulated over 12 hours. The perceptual and upper thresholds in pulse-width over 200 days at PFM areas were recorded to assess the stability of the non-invasive sensory neural interface. The efficacy of multi-modality feedback for identification of physical properties of objects was also assessed. Results showed that the AC impedance of customized surface stimulation electrode was stable over 12 hours of stimulation. The perceptual and upper thresholds were stable over 200 days. This non-invasive sensory feedback enabled a forearm amputee to identify the compliance and length of grasped objects with an accuracy of 100 %. Results illustrated that the multi-modality sensory feedback system provided stable and sufficient sensory information for perceptual discrimination of physical features of grasped objects.
|
|
15:45-17:30, Paper WeEP-13.2 | |
Atomic Force Microscope Characterization of the Bending Stiffness and Surface Topography of Silicon and Polymeric Electrodes |
|
Riggins, Ti'Air | Michigan State University |
Li, Wen | Michigan State University |
Purcell, Erin | Michigan State University |
Keywords: Neural interfaces - Tissue-electrode interface, Neural interfaces - Biomaterials, Neural interfaces - Bioelectric sensors
Abstract: Implanted electrodes in the brain are increasingly used in research and clinical settings to understand and treat neurological conditions. However, a foreign body response typically occurs after implantation, and glial encapsulation of the device is a commonly observed. Multiple factors affect how gliosis surrounding the implantable electrodes evolves. Characterizing and measuring the surface features and mechanical properties of these devices may allow us to predict where gliosis will occur, and understanding how electrode design features may impact astrogliosis may give researchers a set of design guidelines to follow to maximize chronic performance. In this study, we used atomic force microscopy to measure surface roughness on parylene, polyimide, and silicon devices. Multiple features on microelectrode arrays were measured, including electrode sites, traces, and the bulk substrate. We found differences in surface roughness according to device material, but not device features. We also directly measured the bending stiffness of silicon devices, providing a more exact quantification of this property to corroborate calculated estimates.
|
|
15:45-17:30, Paper WeEP-13.3 | |
Calcium Activation of Parvalbumin Neurons Induced by Electrical Motor Cortex Stimulation |
|
Wang, Ruixue | Qiushi Academy for Advanced Studies, Zhejiang Universtity, |
Han, Jiawei | Zhejiang University |
Wang, Xi | Zhejiang University Interdisciplinary Institute of Neuroscience |
Xu, Yuhang | Coventry University |
Zheng, Dingchang | Coventry University |
You, Heecheon | Pohang University of Science and Technology |
Zhang, Shaomin | Zhejiang University |
Keywords: Brain-computer/machine interface, Neural stimulation
Abstract: Electrical motor cortex stimulation (EMCS) has been used for Parkinson’s Disease (PD) treatment. Some studies found that distinct cell types might lead to selective effects. As the largest subgroup of interneurons, Parvalbumin (PV) neurons have been reported to be involved in the mechanisms of therapeutic efficacy for PD treatment. However, little is known about their responses to the EMCS. In this study, we used in-vivo two-photon imaging to record calcium activities of PV neurons (specific type) and all neurons (non-specific type) in layer 2/3 primary motor cortex (M1) during EMCS with various stimulus parameters. We found PV neurons displayed different profiles of activation property compared to all neurons. The cathodal polarity preference of PV neurons decreased at a high-frequency stimulus. The calcium transients of PV neurons generated by EMCS trended to be with large amplitude and short active duration. The optimal activation frequency of PV neurons is higher than that of all neurons. These results improved our understanding of the selective effects of EMCS on specific cell types, which could bring more effective stimulation protocols for PD treatment.
|
|
15:45-17:30, Paper WeEP-13.4 | |
How the Number and Distance of Electrodes Change the Induced Electric Field in the Cortex During Multichannel TDCS |
|
Santos Videira, Andreia | Instituto De Biofísica E Engenharia Biomédica, Faculdade De Ciên |
Sózinho, Diogo Canadas | Instituto De Biofísica E Engenharia Biomédica, Faculdade De Ciên |
de Oliveira Pires, Leonor | Instituto De Biofísica E Engenharia Biomédica, Faculdade De Ciên |
Andrade, Alexandre | Instituto De Biofísica E Engenharia Biomédica, Faculdade De Ciên |
Ferreira, Hugo | Institute of Biophysics and Biomedical Engineering |
Miranda, Pedro Cavaleiro | Faculdade De Ciências, Universidade De Lisboa |
Fernandes, Sofia Rita | Faculdade De Ciências E Faculdade De Medicina Da Universidade De |
Keywords: Neural stimulation, Neuromuscular systems - Computational modeling, Brain physiology and modeling - Neuron modeling and simulation
Abstract: Multichannel transcranial direct current stimulation (tDCS) is a promising approach to target neuromodulation of neural networks by making use of variable number of electrodes and distances to facilitate/inhibit specific connectivity patterns. Optimization of the electric field (EF) spatial distribution through computational models can provide a more accurate definition of the stimulation settings that are more effective. In this study, we investigate the effect of increasing the number of cathodes around a central anode placed over the target. We demonstrate that anode-cathode distance has the largest influence in the EF and using more than 3 cathodes did not result in considerable changes in the EF magnitude and direction. This could be relevant for simultaneous tDCS-electroencephalography (EEG) applications, by saving electrode positions for EEG acquisition.
|
|
15:45-17:30, Paper WeEP-13.5 | |
An In-Vitro System for Closed Loop Neuromodulation of Peripheral Nerves |
|
Ribeiro, Mafalda | University of Bath |
Jabban, Leen | University of Bath |
Rettore Andreis, Felipe | Aalborg University |
Nørgaard Gomes dos Santos Nielsen, Thomas | Aalborg University |
Rocha, Paulo R. F. | Universidade De Coimbra |
Metcalfe, Benjamin William | University of Bath |
Keywords: Neural interfaces - Implantable systems, Neural signal processing, Neural stimulation
Abstract: Current neuromodulation research relies heavily on in-vivo animal experiments for developing novel devices and paradigms, which can be costly, time-consuming, and ethically contentious. As an alternative to this, in-vitro systems are being developed for examining explanted tissue in a controlled environment. However, these systems are typically tailored for cellular studies. Thus, this paper describes the development of an in-vitro system for electrically recording and stimulating large animal nerves. This is demonstrated experimentally using explanted pig ulnar nerves, which show evoked compound action potentials (eCAPs) when stimulated. These eCAPs were examined both in the time and velocity domain at a baseline temperature of 20°C, and at temperatures increasing up to those seen in-vivo (37°C). The results highlight that as the temperature is increased within the in-vitro system, faster conduction velocities (CVs) similar to those present in-vivo can be observed. To our knowledge, this is the first time an in-vitro peripheral nerve system has been validated against in-vivo data, which is crucial for promoting more widespread adoption of such systems for the optimisation of neural interfaces.
|
|
15:45-17:30, Paper WeEP-13.6 | |
Influence of Temporal Interference Stimulation Parameters on Point Neuron Excitability |
|
Plovie, Tom | Ghent University |
Schoeters, Ruben | Ghent University |
Tarnaud, Thomas | Ghent University/IMEC |
Martens, Luc | IMinds / Ghent University |
Joseph, Wout | Ghent University |
Tanghe, Emmeric | Ghent University |
Keywords: Neural stimulation - Deep brain, Neural signals - Nonlinear analysis, Neurological disorders
Abstract: Temporal interference (TI) stimulation is a technique in which two high frequency sinusoidal electric fields, oscillating at a slightly different frequency are sent into the brain. The goal is to achieve stimulation at the place where both fields interfere. This study uses a simplified version of the Hodgkin-Huxley model to analyse the different parameters of the TI-waveform and how the neuron reacts to this waveform. In this manner, the underlying mechanism of the reaction of the neuron to a TI-signal is investigated.
|
|
15:45-17:30, Paper WeEP-13.7 | |
Quantifying the Influence of Stimulation Protocols on Neural Network Connectivity Inference to Optimize Rapid Network Measurements |
|
Ouchi, Tomohiro | University of Washington |
Orsborn, Amy | University of Washington |
Keywords: Neural stimulation, Brain physiology and modeling - Neural circuits, Neural signals - Machine learning & Classification
Abstract: Connectivity is key to understanding neural circuit computations. However, estimating in vivo connectivity using recording of activity alone is challenging. Issues include common input and bias errors in inference, and limited temporal resolution due to large data requirements. Perturbations (e.g. stimulation) can improve inference accuracy and accelerate estimation. However, optimal stimulation protocols for rapid network estimation are not yet established. Here, we use neural network simulations to identify stimulation protocols that minimize connectivity inference errors when using generalized linear model inference. We find that stimulation parameters that balance excitatory and inhibitory activity minimize inference error. We also show that pairing optimized stimulation with adaptive protocols that choose neurons to stimulate via Bayesian inference may ultimately enable rapid network inference.
|
|
15:45-17:30, Paper WeEP-13.8 | |
Spinal Cord Transcutaneous Stimulation Enables Volitional Knee Extension in Motor-Complete SCI |
|
Momeni, Kamyar | Kessler Foundation |
Pilkar, Rakesh | Kessler Foundation |
Ravi, Manikandan | Kessler Foundation |
Bheemreddy, Akhil | Kessler Foundation |
Garbarini, Erica | Kessler Foundation |
Forrest, Gail F | Kessler Foundation |
Keywords: Neural stimulation, Neurorehabilitation, Neuromuscular systems - EMG processing and applications
Abstract: Non-invasive spinal cord transcutaneous stimulation (scTS) is often applied to one or multiple spinal segments and may improve motor control after spinal cord injury (SCI). The purpose of this pilot study was to apply tonic scTS to an individual with motor-complete spinal cord injury (SCI) in order to initiate and maintain volitional control during a specific lower-extremity motor task. The participant’s legs were placed in a gravity-neutral position, and he was asked to extend his knee, with and without the presence of tonic scTS. Our results show intentional voluntary control of knee extension with scTS (with no assistance). Our preliminary findings highlight how scTS neuromodulation of the spinal circuitry has the potential to restore motor function for people with motor-complete SCI.
|
|
15:45-17:30, Paper WeEP-13.9 | |
Standardization of Stimulus Location for Functional Electrical Stimulation of Swallowing |
|
Koike, Yuki | Tokyo University of Science |
Hashimoto, Takuya | Tokyo University of Science |
Kikuchi, Takahiro | Musashino Red Cross Hospital |
MICHIWAKI, Yukihiro | Musashino Red Cross Hospital |
Keywords: Motor neuroprostheses - Neuromuscular stimulation, Neurorehabilitation, Neurological disorders
Abstract: Many nerves and muscles are involved in the swallowing process; hence neuromuscular disorders cause dysphagia resulting in aspiration pneumonia. A critical movement in the pharyngeal phase of swallowing is hyolaryngeal elevation to help protect the airway and open a relaxed upper esophageal sphincter. Functional electrical stimulation (FES) is expected to improve the function of muscles acting on the hyolaryngeal motion, which may contribute to airway protection of dysphagic patients. However, it is difficult to select the stimulus locations that effectively assist laryngeal elevation without the expertise in the anatomy of swallowing-related muscles. Therefore, this study investigated the method to standardize the selection of the stimulus locations based on the dimensions of the larynx. In addition, the effect of stimulus intensity on the amount of laryngeal elevation was evaluated.
|
|
15:45-17:30, Paper WeEP-13.10 | |
Selective Neuromodulation of Retinal Ganglion Cells Via a Hybrid Optic-Nerve and Retinal Neuroprosthesis for Visual Restoration |
|
Pratiwi, Ariastity Mega | University of Sydney |
Kékesi, Orsolya | University of Sydney |
Suaning, Gregg | The University of Sydney |
Keywords: Brain physiology and modeling - Neuron modeling and simulation, Sensory neuroprostheses - Visual, Neural stimulation
Abstract: A visual neuroprosthesis delivers electrical stimulation to the surviving neural cells of the visual pathway to produce prosthetic vision. While the retina is often chosen as the stimulation site, current retinal prostheses are hindered by the lack of functional selectivity that impairs the resolution. A possible strategy to improve the resolution is to combine the retinal stimulation and the stimulation of the optic nerve bundle, which contains myelinated fibres of retinal ganglion cells (RGCs)’ axons that vary in diameter. In this study, we used a computational model of retinal ganglion cells (RGCs) with a myelinated axon to predict whether the frequency of electrical stimulation delivered to the optic nerve can be modulated to preferentially inhibit a subset of optic nerve fibres classified by diameter. The model combined a finite element model of bipolar penetrating electrodes delivering sinusoidal stimulation in the range of 25-10000 Hz to the optic nerve, and a double-cable model, to represent an optic nerve fibre. We found that the diameter of the axon fibre and ion kinetic properties of the RGC affect the neuron’s frequency response, demonstrating the potential of an optic nerve stimulation to produce selective inhibition based on the axon fibre size.
|
|
15:45-17:30, Paper WeEP-13.11 | |
A Transcutaneous Electrical Stimulation Method for Sensory Substitution of Wrist Extension-Flexion: A Preliminary Study "Nomination for Yichen Han" (withdrawn from program) |
|
Yichen, Han | Chongqing University |
zuo, yufeng | Chongqing University |
胡, 慧敏 | 重庆大学 |
Song, Hongliang | Chongqing University |
Zhou, yi | Chongqing University |
Yinping, Lu | Chongqing University |
LEI, LI | Chongqing Southwest Hospital |
xing, wang | Chongqing University |
|
15:45-17:30, Paper WeEP-13.12 | |
High-Frequency rTMS Combined with Task-Specific Hand Motor Training Modulates Corticospinal Plasticity in Motor Complete Spinal Cord Injury: A Case Report |
|
Brihmat, Nabila | Kessler Foundation |
Bayram, Mehmed Bugrahan | Kessler Foundation |
Allexandre, Didier | Kessler Foundation |
Saleh, Soha | Kessler Foundation |
Guan, Xiaofei | Burke Neurological Institute, Feil Family Brain and Mind Researc |
Yue, Guang | Kessler Foundation |
Zhong, Jian | Burke Neurological Institute, Feil Family Brain and Mind Researc |
Forrest, Gail F | Kessler Foundation |
Keywords: Neural stimulation, Neuromuscular systems - Central mechanisms, Neurorehabilitation
Abstract: Since its first use in spinal cord injury (SCI) in the early 2000s, high-frequency repetitive transcranial magnetic stimulation (HF-rTMS) demonstrated a capacity to modulate corticospinal excitability (CSE) and motor performance. Studies focused on individuals with incomplete SCI. Here, we examined the feasibility of a 15-day therapeutic stimulation protocol combining HF-rTMS with task-specific motor training targeting the weaker hand in an individual with early chronic complete SCI. In this case report, we present evidence of progressive increase of CSE at rest and during muscle activation, and decreased cortical inhibition, associated with a trend toward improvement in pinch function of the weaker hand. These promising findings need to be confirmed in a larger population. Clinical Relevance— These preliminary results are promising and demonstrate the importance of a large number of training session repetitions to induce consistent changes relevant to the recovery after a complete SCI.
|
|
WeEP-14 |
Hall 5 |
Theme 07. Human Movement Sensing P1 |
Poster Session |
Chair: Noury, Norbert | University Lyon 1 |
|
15:45-17:30, Paper WeEP-14.1 | |
Predicting Risk of Falls in Elderly Using a Single Inertial Measurement Unit on the Lower-Back by Estimating Spatio-Temporal Gait Parameters |
|
Aqueveque, Pablo | Universidad De Concepcion |
Gomez, Britam | University of Concepcion |
Ortega, Paulina | Universidad De Concepción |
Peña, Guisella | Universidad De Concepción |
Retamal, Gustavo | Universidad De Concepción |
Cano, Roberto | Universidad Rey Juan Carlos |
Keywords: Modeling and analysis, Novel methods, Sensor systems and Instrumentation
Abstract: One of the consequences of aging is the increased risk of falls, especially when someone walks in unknown or uncontrolled environments. Usually, gait is evaluated through observation and clinical assessment scales to identify the state and deterioration of the patient's postural control. Lately, technological systems for bio-mechanical analysis have been used to determine abnormal gait states being expensive, difficult to use, and impossible to apply in real conditions. In this article, we explore the ability of a system based on a single inertial measurement unit located in the lower back to estimate spatio--temporal gait parameters by analyzing the signals available in the Physionet repository "Long Term Movement Monitoring Database" which, together with automatic classification algorithms, allow predicting the risk of falls in the elderly population. Different classification algorithms were trained and evaluated, being the Support Vector Machine classifier with a third-degree polynomial kernel, cost function C = 2 with the best performance, with an Accuracy = 59%, Recall = 91%, and F1- score = 71%, providing promising results regarding a proposal for the quantitative, online and realistic evaluation of gait during activities of daily living, which is where falls actually occur in the target population.
|
|
15:45-17:30, Paper WeEP-14.2 | |
Magnetometer-Free Kalman Filter for Motor-Based Assessment of Prodromal Parkinson’s Disease |
|
Guaitolini, Michelangelo | The BioRobotics Institute, Scuola Superiore Sant'Anna, 56127, Pi |
Rovini, Erika | University of Florence |
Galperti, Guenda | Scuola Superiore Sant'Anna, the BioRobotics Institute |
Fiorini, Laura | University of Florence |
Cavallo, Filippo | University of Florence |
Keywords: Health monitoring applications, Sensor systems and Instrumentation, Wearable wireless sensors, motes and systems
Abstract: Observing the kinematics of specific motor tasks, such as finger tapping (FT), provides an objective and consistent quantification of the severity of neurodegenerative diseases. However, the current clinical practice mostly relies on visual observations performed by the clinician. Thus, the assessment is subjective. In this paper, we propose a magnetometer-free Kalman filter (KF) to assess FT features using wearable, inertial sensors. The KF was used to assess features during two different FT tasks, namely forefinger tapping (FTAP) and thumb-forefinger tapping (THFF). The proposed KF was validated against a camera-based reference and compared with a strap-down integration-based method. Comparison between KF method and camera reference showed low discrepancies in terms of root mean square error (RMSE) for considered features: namely number of repetitions (RMSE < 0.7), tapping frequency (RMSE < 0.1 Hz), and amplitude (RMSE < 2.6 deg). An high correlation coefficient between amplitudes was also obtained. The proposed KF performed better than the strap-down integration method on both FT tasks, showing lower RMSE on every feature as well as a higher correlation coefficient. Clinical Relevance— The wearable setup, as well as the proposed magnetometer-free KF, may provide a low-cost, easy-to-use, non-invasive motion tracking system for protocols aimed to assess motor performances in neurodegenerative disorders.
|
|
15:45-17:30, Paper WeEP-14.3 | |
Are Gyroscopes an Added Value in Test-Subject-Independent Activity Recognition with IMUs? |
|
Shang, Meng | KU Leuven |
De Raedt, Walter | Imec |
Varon, Carolina | Université Libre De Bruxelles |
Vanrumste, Bart | Katholieke Universiteit Leuven |
Keywords: IoT sensors for health monitoring, Modeling and analysis
Abstract: Inertial sensors have played a key role in the development of Human Activity Recognition (HAR) systems. Adding gyroscopes in HAR systems leads to increased battery and processing resources. Therefore, it is important to explore their added value compared with using accelerometers only. This study evaluates the added value of gyroscopes in activity recognition. Two public available datasets recorded by accelerometers and gyroscopes were studied. These datasets focus on multiple types of activities: UCI HAR dataset includes walking, walking upstairs, walking downstairs, sitting, standing, laying and WISDM dataset includes 18 hand-oriented and non-hand-oriented activities. Several machine learning models were applied to both datasets for activity recognition. Leave-one-subject-out cross-validation (LOSO) was applied to evaluate the models, where the training set and test set were from different subjects. For UCI HAR dataset, the multilayer perceptron (MLP) model obtained the highest f1-scores. Adding a gyroscope on the waist significantly improved the f1-scores of sitting and laying (both p<0.05). For WISDM dataset, the support vector machines (SVM) model obtained the highest f1-scores. The gyroscope on the wrist improved hand-oriented activities while the gyroscope in the pockets improved non-hand-oriented activities (all p<0.05). The results showed the improvement for recognition performance by adding gyroscopes. However, the improvement was dependent on the type of activity and the mounting place of the gyroscope.
|
|
15:45-17:30, Paper WeEP-14.4 | |
3DKnITS: Three-Dimensional Digital Knitting of Intelligent Textile Sensor for Activity Recognition and Biomechanical Monitoring |
|
Wicaksono, Irmandy | Massachusetts Institute of Technology |
Hwang, Peter | Massachusetts Institute of Technology |
Droubi, Samir | Massachusetts Institute of Technology |
Wu, Franny Xi | Wellesley College |
Serio, Allison | Massachusetts Institute of Technology |
Yan, Wei | Massachusetts Institute of Technology |
Paradiso, Joseph | Massachusetts Institute of Technology |
Keywords: Textile-electronic integration, Smart textiles and clothings, Wearable body sensor networks and telemetric systems
Abstract: We present an approach to develop seamless and scalable piezo-resistive matrix-based intelligent textile using digital flat-bed and circular knitting machines. By combining and customizing functional and common yarns, we can design the aesthetics and architecture and engineer both the electrical and mechanical properties of a sensing textile. By incorporating a melting fiber, we propose a method to shape and personalize three-dimensional piezo-resistive fabric structure that can conform to the human body through thermoforming principles. It results in a robust textile structure and intimate interfacing, suppressing sensor drifts and maximizing accuracy while ensuring comfortability. This paper describes our textile design, fabrication approach, wireless hardware system, deep-learning enabled recognition methods, experimental results, and application scenarios. The digital knitting approach enables the fabrication of 2D to 3D pressure-sensitive textile interiors and wearables, including a 45 x 45 cm intelligent mat with 256 pressure-sensing pixels, and a circularly-knitted, form-fitted shoe with 96 sensing pixels across its 3D surface both with linear piezo-resistive sensitivity of 39.4 for up to 500 N load. Our personalized convolutional neural network models are able to classify 7 basic activities and exercises and 7 yoga poses in-real time with 99.6% and 98.7% accuracy respectively. Further, we demonstrate our technology for a variety of applications ranging from rehabilitation and sport science, to wearables and gaming interfaces.
|
|
15:45-17:30, Paper WeEP-14.5 | |
Analysis of Simple Algorithms for Motion Detection in Wearable Devices |
|
Carretero Pérez, Ana | B105 UPM |
Araujo, Alvaro | Universidad Politécnica De Madrid |
Keywords: Wearable sensor systems - User centered design and applications, Modeling and analysis
Abstract: Brain Computer Interfaces are used to obtain relevant information from the electroencephalogram (EEG) with a concrete objective. The evoked potentials related to movement are much demanded nowadays, in particular the ones associated to imagery movement. The objective of this work is to develop simple algorithms to imagery motion detection that can be included in a non-invasive wearable that everybody can use in a comfortable way for new services and applications. A wearable implies low resources, which is the most important requirement that the algorithms have. A public database with 105 subjects doing an upper-limb imagery movement is used. We have developed two algorithms (FBA and BLA) based on three characteristics of the signal (correlation, wavelet energy per segment and wavelet energy per electrode). They are tested for different number of electrodes and frequency bands. The best performance is found for 6 electrodes. The beta band is not the only band who achieves good performances. In fact, in this study the range between 25 Hz – 30 Hz has obtained the best performance using 6 electrodes. The conclusions show that these simple algorithms not fit well with the wearable requirements. However, it shows the need of adaptive algorithms to bypass the differences between subjects. Also, it affirms that more electrodes not lead to a better information, as well as, less electrodes not lead to a worse information. The same goes for frequency, where not only the beta band have the information required that fits our needs.
|
|
15:45-17:30, Paper WeEP-14.6 | |
Continual Learning for Activity Recognition |
|
Sah, Ramesh Kumar | Washington State University |
Mirzadeh, Seyed Iman | Washington State University |
Ghasemzadeh, Hassan | Arizona State University |
Keywords: Modeling and analysis, Novel methods, Health monitoring applications
Abstract: The recent success of deep neural networks in prediction tasks on wearable sensor data is evident. However, in more practical online learning scenarios, where new data arrive sequentially, neural networks suffer severely from the catastrophic forgetting problem. In real-world settings, given a pre-trained model on the old data, when we collect new data, it is practically infeasible to re-train the model on both old and new data because the computational costs will increase dramatically as more and more data arrive in time. However, if we fine-tune the model only with the new data because the new data might be different from the old data, the neural network parameters will change to fit the new data. As a result, the new parameters are no longer suitable for the old data. This phenomenon is known as catastrophic forgetting, and continual learning research aims to overcome this problem with minimal computational costs. While most of the continual learning research focuses on computer vision tasks, implications of catastrophic forgetting in wearable computing research and potential avenues to address this problem have remained unexplored. To address this knowledge gap, we study continual learning for activity recognition using wearable sensor data. We show that the catastrophic forgetting problem is a critical challenge for the real-world deployment of machine learning models for wearable sensor data. Moreover, we show that the catastrophic forgetting problem can be alleviated by employing various training techniques.
|
|
15:45-17:30, Paper WeEP-14.7 | |
Automatic Detection of Falling of the Elderly Subject among His Daily Activities |
|
Noury, Norbert | University Lyon 1 |
Keywords: Mechanical sensors and systems, Wearable low power, wireless sensing methods
Abstract: Most elderly patients after falling, being not able to rise up or call for help, are particularly at risk of complication. This urges for the development of autonomous devices for earliest detection of falls. This paper is an overview of the current design approaches to autonomous fall detectors – sensors and algorithms- and a methodology to assess their efficiency. We then propose our fall sensor, which features high sensitivity (95%) and specificity (99%) on simulated falls in lab settings, and lower sensitivity (62.5%) in real settings in a group of 10 patients, with 8 falls detected over a period of 28 days.
|
|
15:45-17:30, Paper WeEP-14.8 | |
Detection of Epileptic Seizure Using Accelerometer Time Series Data and Hidden Markov Model |
|
Agrahri, Anshuman | Department of Electronics & Communications Engineering, Indian I |
Tyagi, Alok | Department of Electronics & Communications Engineering, Indian I |
Kumar, Dheeraj | Indian Institute of Technology (IIT) Roorkee |
Kusumakar, Shitanshu | Department of Electrical & Electronic Engineering, the Universit |
Palaniswami, Marimuthu | The University of Melbourne |
Yan, Bernard | The Royal Melbourne Hospital |
Keywords: IoT sensors for health monitoring, Modeling and analysis, Wearable low power, wireless sensing methods
Abstract: Epilepsy is one of the most prevalent neurological diseases globally, which causes seizures in the patient. As per a survey done worldwide, it is found that approximately 70 million people are living with epilepsy (~ 1% of the total population of the world). Effective detection of these seizures requires specialized approaches such as video and electroencephalography monitoring, which are expensive and are mainly available at specialized hospitals and institutes. Hence, there is a need to develop simpler and affordable systems that can be made available to health care centers and patients for accurate detection of epileptic seizures. A wireless remote monitoring system based on a wrist-worn accelerometer is an optimum choice for the same. Sophisticated algorithms need to be developed for effectively detecting seizure events from this accelerometer data with minimal false alarms. This paper presents a Hidden Markov Model (HMM) based probabilistic approach applied to the reduced-dimension feature vector representation of time-series accelerometer data to detect epileptic seizures. The results obtained from the HMM were compared with three commonly used machine learning models viz. support vector machine (SVM), logistic regression, and random forest. The proposed approach was able to detect 95.7% of seizures with a low false alarm rate of 14.8% with a run time of just under 24 seconds.
|
|
15:45-17:30, Paper WeEP-14.9 | |
Sleep Posture Detection Using an Accelerometer Placed on the Neck |
|
Abdulsadig, Rawan S. | Imperial College London |
Singh, Sukhpreet | Imperial College |
Patel, Zaibaa | Imperial College London |
Rodriguez-Villegas, Esther | Imperial College London |
Keywords: Physiological monitoring - Modeling and analysis, Modeling and analysis, Wearable low power, wireless sensing methods
Abstract: Sleep position monitoring is key when attempting to address posture triggered sleep disorders. Many studies have explored sleep posture detection from a dedicated physical sensing channel exploiting optimum body locations, such as the torso; or alternatively non-contact approaches. But, little work has been done to try to detect sleep position from a body location which, whilst being suboptimal for that purpose, does however allow for better extraction of more critical biomarkers from other sensing modalities, making possible multi-modal monitoring in certain clinical applications. This work presents two different approaches, at varying levels of complexity, for detecting 4 main sleep positions (supine, prone, lateral right and lateral left) from accelerometry data obtained by a single wearable device placed on the neck. An ultra light-weight threshold-based model is presented in this work, in addition to an Extra-Trees classifier. The threshold-based model was able to achieve 95% average accuracy and 0.89 F1-score on out-of-sample data, showing that it is possible to obtain a moderately high classification performance using a simple rule-based model. The Extra-Trees classifier, on the other hand, was able to achieve 99% average accuracy and 0.99 average F1score using only 25 base estimators with maximum depth of 20. Both models show promise in detecting sleep posture with high accuracy when collecting the signals from a neck-worn accelerometer sensor.
|
|
15:45-17:30, Paper WeEP-14.10 | |
Human Activity Recognition from Textile Electrocardiograms |
|
Klingenberg, Arne | TU Braunschweig |
Purrucker, Valentin | Peter L. Reichertz Institute for Medical Informatics |
Schüler, Willi | Peter L. Reichertz Institute for Medical Informatics |
Ganapathy, Nagarajan | Indian Institute of Technology Madras |
Spicher, Nicolai | TU Braunschweig |
Deserno, Thomas | TU Braunschweig |
Keywords: Modeling and analysis, Smart textiles and clothings, Sensor systems and Instrumentation
Abstract: Textile sensors for physiological signals bear the potential of unobtrusive and continuous application in daily life. Recently, textile electrocardiography (ECG) sensors became available which are of particular interest for physical activity monitoring due to the high effect of exercise on the heart rate. In this work, we evaluate the effectiveness of a single-lead ECG signal acquired using a non-medical-grade ECG shirt for human activity recognition (HAR). Healthy volunteers (N=10) wore the shirt during four different activities (sleeping, sitting, walking, running) in an uncontrolled environment and ECG data (256 Hz, 12 Bit) was stored, manually checked, and unusable segments (e.g. no sensor contact) were removed, resulting in a total of 228 hours of recording. Signals were split in short segments of different duration (10, 30, 60s), transformed using the Short-time Fourier Transform (STFT) to a spectrogram image and fed into a state-of-the-art convolutional neural network (CNN). The best configuration results in an F1-Score of 73% and an accuracy of 77% on the test set. Results with leave-one-subject-out cross-validation show F1-Scores ranging from 41% to 80%. Thus, a single-lead, wearable-generated ECG has an informative value for HAR to a certain extent. In future work, we aim at using more sensors of the smart shirt and sensor fusion.
|
|
WeEP-15 |
Hall 5 |
Theme 07. Mobile Sensors and Systems P1 |
Poster Session |
|
15:45-17:30, Paper WeEP-15.1 | |
VoiceCare: A Voice-Interactive Cognitive Assistant on a Smartwatch for Monitoring and Assisting Daily Healthcare Activities |
|
Samyoun, Sirat | Department of Computer Science, University of Virginia |
Stankovic, John | Univ of Virgnia |
Keywords: Health monitoring applications, Wearable sensor systems - User centered design and applications
Abstract: Following several health activities in daily life (e.g., medication/exercise plans, handwashing, physiological monitoring) properly often requires monitoring and assistance support. Although the emergent smartwatch and wearable technologies have opened great opportunities to monitor these activities in the wild, existing smartwatch-based systems do not interactively guide the user and also lacks comprehensiveness to provide knowledge related to this set of daily activities. To overcome these limitations of the state-of-the-art, we present VoiceCare, a wearable cognitive assistant on a smartwatch for daily life healthcare that interactively | |