Degraded Data in Nonprofit Homebrew Databases
This is a strong qualitative CHI paper because it turns a familiar complaint—bad data—into a more precise account of how nonprofit work actually produces and tolerates degraded records. The contribution is not a new system, but a careful empirical vocabulary and a useful reframing of what counts as a problem under constraint.
Axes Lens
Rare contribution shape, typical evidence profile. The point here is not a score. It is to show what kind of claim the paper makes, and whether the evidence pattern is unusual or baseline in this 268 -review set.
Contribution shape
- Knowledge form
- descriptive knowledge typical · 92/268
- Novelty type
- empirical finding typical · 68/268
- Abstraction level
- organization less common · 4/268
- Generalization target
- organizational context typical · 20/268
- Validation mode
- qualitative study typical · 63/268
Evidence profile
- Evidence strength
- strong typical · 158/268
- Claim alignment
- strong typical · 231/268
- Overclaim risk
- low typical · 53/268
Review Summary
This paper’s value is in how it repositions data quality as an organizationally situated outcome rather than a purely technical failure. The authors do not simply say that nonprofit databases are messy; they show, through interviews with 19 volunteer administrators across 14 U.S. nonprofits, that degraded data takes recurring forms and that those forms are produced by legitimate tradeoffs in time, money, technology, and relationships. That is a meaningful CHI contribution because it gives the field a more grounded way to talk about why “better data” is often not the dominant objective in practice. The five-way characterization— incomplete, out-of-date/out-of-sync, bulk/lack of fidelity, anecdotal/modest, and garbage—helps organize a space that is usually discussed only in vague terms. I also think the paper is careful in a way that strengthens it: it does not overstate harm, and it explicitly asks whether degraded data is even a problem when the alternative is more work to collect better data. That nuance matters. The main limitation is scope: this is a qualitative study of a specific organizational context, so the findings should be read as descriptive and interpretive rather than broadly generalizable. Still, within that scope, the paper is persuasive and likely to be useful for researchers thinking about data practices, nonprofit systems, and design that respects competing priorities instead of assuming data quality is always the top goal.
What Changed
Canon before
Prior CHI work has characterized messy information ecosystems and information work, but this paper shifts attention from system messiness itself to the downstream quality of the data those systems produce and sustain in nonprofit volunteer-management settings.
Departure from common sense
The paper argues that degraded data in nonprofit homebrew databases should not be read mainly as a technical defect to eliminate. Instead, it is presented as a predictable consequence of people pursuing other legitimate priorities—time, money, technology, and relationships—so data quality is often secondary to more urgent work.
Actual novelty
The paper’s novelty is a new empirical characterization of degraded data in nonprofit volunteer-management databases: it identifies five recurring genres— incomplete, out-of-date/out-of-sync, bulk/lack of fidelity, anecdotal/modest, and garbage—and ties them to the practical priorities that shape data work.
Evidence
The paper is grounded in an interview study of 19 volunteer administrators across 14 U.S. nonprofit organizations. It reports five recurring forms of degraded data and argues these arise from legitimate competing priorities rather than simple neglect. The discussion also qualifies the implications, noting that degraded data may not be a problem relative to the costs of collecting better data, while still limiting future uses.
“ 4 Five Genres of Degraded Data The information management ecosystems used in the work of volunteer administrators, even a decade after the initial study in this context, still reflect all of the idiosyncrasies of homebrew databases characterized in the original Void”
actual novelty · Abstract + Method/Data analysis categories + Results overview · confidence 0.80
“ (P6) While degraded data clearly forecloses many alternate forms of data work, it is not clear that degraded data is more of a “problem” than the alternative problems that would be created by the additional work to collect more or “better” data”
departure from common sense · Abstract + Discussion/Conclusion framing of causes and design stance · confidence 0.73
“ (P6) While degraded data clearly forecloses many alternate forms of data work, it is not clear that degraded data is more of a “problem” than the alternative problems that would be created by the additional work to collect more or “better” data”
limitation · Section 4.6 'Is Degraded Data a Problem?' · confidence 0.64
“ [ 49 ], we recruited informants from the same population of information workers: 19 individuals (14 female) from 14 nonprofit organizations in the United States who are responsible for managing some aspect of the organizations’ volunteer programs”
validation scope · Method: Informants · confidence 0.71
Limits
Method limits
The evidence comes from semi-structured interviews with volunteer administrators in U.S. nonprofits, so the findings are interpretive and context-specific rather than statistically generalizable. The paper’s categories and causal framing are grounded in participants’ accounts, not direct measurement of data quality.
Deployment limits
The design implications are most relevant to nonprofit information ecosystems where volunteer labor, limited resources, and homebrew databases shape data practices. The paper does not establish a universal remedy for degraded data, and any intervention must respect the legitimacy of competing organizational priorities.
Boundary conditions
The findings are bounded by nonprofit volunteer-management contexts and by situations where data collection competes with other work. The paper itself notes that degraded data may have little effect on existing practices, but can foreclose alternate future uses of the data.
Position in field
This is a CHI qualitative contribution that reframes data quality as an organizational and practical outcome of information work, rather than only a technical database problem. It extends prior work on messy information ecosystems by naming recurring degraded-data forms and connecting them to resource constraints and legitimate priorities.