2022 · ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, pp. 561-565 · Field expert review · confidence high

Supervised and Self-supervised Pretraining Based COVID-19 Detection Using Acoustic Breathing/Cough/Speech Signals

Xingyu Chen, Qiushi Zhu, Jie Zhang, Li-Rong Dai

DOI arXiv

Sound classification paper, not SSI.

Verdict: full-text draftPriority: lowConfidence: highBasis: full textCoverage: high

Reading guidance

Verdict: full-text draft · priority low · confidence high
Why it matters: The full text supports a decent challenge entry, but it is plainly outside SSI: model ensemble reaches 86.72 AUC on Track-1 test and the paper headline reports 88.44% AUC on the blind fusion track.
What to trust: Basis: full text. Coverage: high. 3 evidence records back the review.
What is weak: It is a medical audio classifier and offers no SSI relevance beyond generic acoustic modeling. The evidence is challenge-specific and constrained by the DiCOVA data regime. No SSI deployment path exists in the paper. COVID-19 detection from respiratory audio only. Overclaim risk: Any framing as SSI progress would be unsupported..
Read before: SSI review rubric
Read next: SSI archive

Axes

Task: audio-classification
Modality: acoustic breathing, cough, and speech signals
Output: labels
Vocabulary: binary diagnosis
Metrics: ROC-AUC on validation and blind test sets
Evaluation mode: 5-fold cross-validation and blind-test benchmark reporting on the DiCOVA-ICASSP 2022 challenge
Review confidence: high
Overclaim risk: Any framing as SSI progress would be unsupported.

Expert take

The paper is honest about what it is. Section 3.1 ties the experiments to the DiCOVA 2022 challenge data, and Table 1 shows the supervised and self-supervised pretraining variants improving over the listed baselines on Track-1 breathing, with the ensemble reaching 86.72 test AUC and 80.05 validation AUC. The abstract then gives the stronger fusion-track headline number of 88.44% blind-test AUC. That is respectable benchmark engineering, but there is no silent-speech sensing, articulation modeling, or communication interface angle here.

True value

The full text supports a decent challenge entry, but it is plainly outside SSI: model ensemble reaches 86.72 AUC on Track-1 test and the paper headline reports 88.44% AUC on the blind fusion track.

What changed

Canon before

Low-resource acoustic diagnosis tasks often relied on hand-crafted features or single-model training recipes.

Delta from canon

The paper frames the main gain as better pretraining and ensemble strategy, not a new SSI or speech-interface method.

Position in field

Respiratory-audio classification paper that should remain marked as out of scope for SSI.

Evidence

“ SUPERVISED AND SELF-SUPERVISED PRETRAINING BASED COVID-19 DETECTION USING ACOUSTIC BREATHING/COUGH/SPEECH SIGNALS ”

author_claim · ABSTRACT · confidence 0.99

“ Results The proposed model is evaluated on the DiCOVA-ICASSP 2022 challenge dataset [15], which is derived from the crowd-sourced In experiments, we use 5-fold cross-validation to evaluate our su- Coswara dataset [23] and collected from volunteers with different pervised pre-training method, self-supervised pre-training method 22.5 breath_n 22.5 breath_p Table 1: The AUC score of different methods on test/validation sets. ”

validation_scope · 3.1. Datasets · confidence 0.98

“ Results The proposed model is evaluated on the DiCOVA-ICASSP 2022 challenge dataset [15], which is derived from the crowd-sourced In experiments, we use 5-fold cross-validation to evaluate our su- Coswara dataset [23] and collected from volunteers with different pervised pre-training method, self-supervised pre-training method 22.5 breath_n 22.5 breath_p Table 1: The AUC score of different methods on test/validation sets. ”

metric · Table 1: The AUC score of different methods on test/validation sets. · confidence 0.99

Limits

Technical limits

It is a medical audio classifier and offers no SSI relevance beyond generic acoustic modeling.

Evaluation limits

The evidence is challenge-specific and constrained by the DiCOVA data regime.

Deployment limits

No SSI deployment path exists in the paper.

Scope limits

COVID-19 detection from respiratory audio only.