SBM Lab

시스템 생물학 & 바이오인포매틱스 블로그. 프로테오믹스, 바이오마커, 건강과학에 대한 연구 기반 콘텐츠를 제공합니다.

벤 다이어그램 생성기

2~4개 세트를 비교하고 PNG/SVG로 다운로드하세요. 유전자 비교, DEG overlap, 데이터 분석에 활용 가능합니다.

더 보기

limma vs DEqMS for Proteomics — When to Use Which (n=3 to n=20+ Comparison)

Both limma and DEqMS provide moderated t-statistics for proteomics differential expression. limma was built for microarrays; DEqMS adds a proteomics-specific variance prior tied to peptide count. This guide compares them on small-n (n=3) and larger (n=10, n=20) proteomics designs with real recommendations on when DEqMS's extra step is worth it.

5/27

Imputing Missing Values in Proteomics — knn vs minDet vs MNAR — What Actually Works

Proteomics datasets are full of NAs, and how you handle them can flip your DEP list. This guide compares the four real-world options — leave NAs alone, k-Nearest Neighbors (knn), minimum detected (minDet), and explicit MNAR (Missing Not At Random) imputation — with practical recommendations for DIA-NN and MaxQuant outputs.

5/27

BioMart Pig ↔ Mouse 1:1 Ortholog Mapping for Cross-Species Proteomics (R + Python Tutorial)

Step-by-step tutorial: download the Ensembl BioMart Release 111 Sus scrofa ↔ Mus musculus 1:1 ortholog table and join it to your per-species protein quantification in R (biomaRt) and Python (pandas). Why gene-symbol matching is fragile, how to handle one-to-many and many-to-many orthologs, and how to validate your mapping against UniProt.

5/23

Why You Must NOT Merge Species FASTA Databases in Cross-Species Proteomics (Shared Peptide Problem)

In cross-species LC-MS/MS proteomics (Porcine ECM vs Mouse Matrigel, human vs mouse cells, host-pathogen, etc.) merging species FASTA into one search database breaks protein quantification. This is the shared-peptide ambiguity problem. Here's why it happens, how to set up species-separated searches in MaxQuant and DIA-NN, and how to handle peptides that genuinely match both species.

5/23

Reanalyzing PRIDE PXD023694 — Matrigel Nuclear Contaminants (EWSR1, RUVBL2) You'll Find in Real Data

A practical reanalysis of PRIDE dataset PXD023694 (Cho lab cross-species ECM) — what proteins actually come out of mouse Matrigel vs porcine ECM, the basement membrane signature (LAMB1/NID1/LAMC1/HSPG2) that replicates Park et al. 2026, and the contamination patterns (Matrigel nuclear proteins EWSR1/RUVBL2, porcine smooth muscle remnants TPM4/MYL9) that simulations miss but real data shows.

5/23

From DIA-NN Output to Paper Draft: A Complete AI-Assisted Proteomics Workflow (2026)

An honest, end-to-end guide to using AI across the full DIA-NN proteomics pipeline — from report.tsv quantification and downstream statistics through figure interpretation, Discussion writing, and a manuscript draft. Includes real prompts, where AI fails (with caught examples), verification checklists, and 2026 journal disclosure requirements.

5/22

모든 글 보기 →

SBM Lab

벤 다이어그램 생성기

최신 글

Can an LLM Run an RNA-seq Analysis on Its Own? Building ARIA, a Decision-Aware Transcriptome Framework

A MOGONET-Style Multi-Omics Biomarker Pipeline: Why a Near-Random Graph Net Still Earns Its Place

FragPipe vs MaxQuant — 2026 Real Speed Benchmark (Same Data, Same Output Quality)

더 보기

limma vs DEqMS for Proteomics — When to Use Which (n=3 to n=20+ Comparison)

Imputing Missing Values in Proteomics — knn vs minDet vs MNAR — What Actually Works

BioMart Pig ↔ Mouse 1:1 Ortholog Mapping for Cross-Species Proteomics (R + Python Tutorial)

Why You Must NOT Merge Species FASTA Databases in Cross-Species Proteomics (Shared Peptide Problem)

Reanalyzing PRIDE PXD023694 — Matrigel Nuclear Contaminants (EWSR1, RUVBL2) You'll Find in Real Data

From DIA-NN Output to Paper Draft: A Complete AI-Assisted Proteomics Workflow (2026)