Inspecting filtering and imputation outputs (xlsx files)
Code
## Inspect filtering and imputation summary tables
library(readxl)
path_filtered <- system.file(
"extdata",
"HaCaT_filtered_out_proteins.xlsx",
package = "ProteinBatcher"
)
path_imputed <- system.file(
"extdata",
"HaCaT_Imputation_step.xlsx",
package = "ProteinBatcher"
)
filtered_df <- readxl::read_xlsx(path_filtered)
imputed_df <- readxl::read_xlsx(path_imputed)
head(filtered_df)
## # A tibble: 6 × 3
## protein_id gene_name reason
## <chr> <chr> <chr>
## 1 MBLC1_HUMAN MBLAC1 "Failed per-condition non-missing threshold: >= 0.50 wi…
## 2 SH321_HUMAN SH3D21 "Failed per-condition non-missing threshold: >= 0.50 wi…
## 3 AN33B_HUMAN ANKRD33B "Failed per-condition non-missing threshold: >= 0.50 wi…
## 4 SFI1_HUMAN SFI1 "Failed per-condition non-missing threshold: >= 0.50 wi…
## 5 KLRF2_HUMAN KLRF2 "Failed per-condition non-missing threshold: >= 0.50 wi…
## 6 APRIO_HUMAN PRNP "Failed per-condition non-missing threshold: >= 0.50 wi…
Code
## # A tibble: 6 × 25
## gene NoTreated_1 `IL13 _1` IL22_1 `COMBO _1` NoTreated_2 `IL13 _2` IL22_2
## <chr> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 NUDT4B 22.5 22.4 22.5 22.5 22.6 22.3 22.6
## 2 SMIM26 20.7 20.3 20.6 20.6 20.6 20.5 20.6
## 3 GATD3;GA… 24.5 24.4 24.3 24.6 24.6 24.5 24.4
## 4 PIGBOS1 21.4 21.4 21.1 21.6 21.6 21.6 20.9
## 5 NBDY 20.4 20.1 20.1 20.1 19.9 20.3 20.0
## 6 MMP24OS 19.4 19.3 18.9 19.4 19.7 19.1 18.8
## # ℹ 17 more variables: `COMBO _2` <dbl>, NoTreated_3 <dbl>, `IL13 _3` <dbl>,
## # IL22_3 <dbl>, `COMBO _3` <dbl>, NoTreated_4 <dbl>, `IL13 _4` <dbl>,
## # IL22_4 <dbl>, `COMBO _4` <dbl>, NoTreated_5 <dbl>, `IL13 _5` <dbl>,
## # IL22_5 <dbl>, `COMBO _5` <dbl>, NoTreated_6 <dbl>, `IL13 _6` <dbl>,
## # IL22_6 <dbl>, `COMBO _6` <dbl>
These Excel files are generated by the workflow to make the preprocessing steps fully transparent.
HaCaT_filtered_out_proteins.xlsx lists proteins removed during the missingness-based filtering step. This allows users to verify which features were excluded and why (e.g. insufficient detection across conditions).
HaCaT_Imputation_step.xlsx summarizes the imputation strategy applied to each protein and sample, distinguishing mean-imputed values from LDV (left‑censored) imputations.
Together, these tables allow direct comparison of the dataset before and after filtering and imputation, which is critical for quality control and interpretability in label‑free proteomics analyses.
Note: For package size reasons, this vignette uses a reduced example of the imputation summary table. The full table is generated during a standard workflow run.
Differential effects - IL13 vs NoTreated contrast
From this point onwards, we focus exclusively on the IL13 vs NoTreated contrast to illustrate how different effect types are interpreted.
Main effect (all proteins)

The Main Effect (All Proteins) volcano plot shows the global IL13 effect in HaCaT cell lines, ignoring any interaction terms (batch). Every tested protein is included, regardless of whether its response depends on batch.
How to read it:
- The x‑axis represents the estimated log2 fold change (IL13 vs NoTreated).
- The y‑axis represents statistical significance (adjusted p‑value).
This plot answers the question: “Which proteins are affected by IL13 on average?”
This view is useful for an initial overview, but it does not distinguish stable effects from batch‑dependent ones.
Common effect (batch‑independent proteins)

The Common Effect plot shows proteins whose IL13 response is not significantly modulated by batch. These proteins pass the interaction test and are therefore considered stable across batch levels.
How to read it:
- Proteins here show a consistent IL13 effect independent of interaction.
- These are typically the most robust candidates for biological interpretation, as their direction and magnitude are reproducible across experimental batches
Conceptually, this plot answers: “Which IL13‑responsive proteins behave consistently across batches?”
Interaction effect (batch‑dependent proteins)

The Interaction Effect plot highlights proteins whose IL13 response differs between batch levels (here: day1 vs day2).
How to read it:
- Significance is driven by the interaction term, not by the average IL13 effect.
- A protein may appear weak or even non-significant in the main effect plot but still show a strong interaction if its response reverses or changes magnitude between batches.
This plot answers: “Which proteins show batch-specific IL13 effects?”
Deregulogram: interpreting batch-dependent effects

The deregulogram compares full IL13 effects between the two batch levels.
- y-axis: full IL13 effect in the reference batch (e.g. day1)
- x-axis: full IL13 effect in the other batch (day2), computed as main effect + interaction effect
How to read it:
- Proteins along the diagonal behave similarly in both batches.
- Proteins deviating from the diagonal show batch-dependent regulation.
- Highlighted points represent proteins for which the interaction is both statistically significant and biologically relevant (effect size threshold).
Key interpretation note: The deregulogram is intentionally more restrictive than the interaction volcano. It is designed to emphasize interpretable, direction-changing or magnitude-shifting effects, rather than listing all statistically significant interactions. As a result, the set of highlighted proteins is not expected to match one-to-one with the interaction volcano, but be more stringent.