SBM Lab
시스템 생물학 & 바이오인포매틱스 블로그. 프로테오믹스, 바이오마커, 건강과학에 대한 연구 기반 콘텐츠를 제공합니다.
벤 다이어그램 생성기
2~4개 세트를 비교하고 PNG/SVG로 다운로드하세요. 유전자 비교, DEG overlap, 데이터 분석에 활용 가능합니다.
바로 사용하기최신 글
Can an LLM Run an RNA-seq Analysis on Its Own? Building ARIA, a Decision-Aware Transcriptome Framework
A companion to our preprint: reproducible pipelines run RNA-seq steps reliably, but the decisions between steps still need an expert. ARIA puts an LLM in that reasoning seat across 8 decision points — and on 4 public datasets it recovered paired designs, technical covariates, and known biology, with cross-method agreement r > 0.99.
A MOGONET-Style Multi-Omics Biomarker Pipeline: Why a Near-Random Graph Net Still Earns Its Place
Honest engineering write-up of a MOGONET-style multi-omics consensus biomarker pipeline. On a small synthetic cohort the graph network scores near-random in leak-free cross-validation (AUC 0.53) — yet the 5-evidence consensus puts known markers in 9 of its top 10. Here is the architecture, the real results, and why a weak single model is still a useful voter.
FragPipe vs MaxQuant — 2026 Real Speed Benchmark (Same Data, Same Output Quality)
Detailed 2026 speed comparison: FragPipe (MSFragger engine) vs MaxQuant on identical DDA datasets, identical FASTA, identical FDR settings. Where FragPipe's 10-50× speedup comes from (fragment ion indexing), where it matches MaxQuant in protein ID counts, and the specific cases where MaxQuant is still better.
더 보기
limma vs DEqMS for Proteomics — When to Use Which (n=3 to n=20+ Comparison)
Both limma and DEqMS provide moderated t-statistics for proteomics differential expression. limma was built for microarrays; DEqMS adds a proteomics-specific variance prior tied to peptide count. This guide compares them on small-n (n=3) and larger (n=10, n=20) proteomics designs with real recommendations on when DEqMS's extra step is worth it.
Imputing Missing Values in Proteomics — knn vs minDet vs MNAR — What Actually Works
Proteomics datasets are full of NAs, and how you handle them can flip your DEP list. This guide compares the four real-world options — leave NAs alone, k-Nearest Neighbors (knn), minimum detected (minDet), and explicit MNAR (Missing Not At Random) imputation — with practical recommendations for DIA-NN and MaxQuant outputs.
BioMart Pig ↔ Mouse 1:1 Ortholog Mapping for Cross-Species Proteomics (R + Python Tutorial)
Step-by-step tutorial: download the Ensembl BioMart Release 111 Sus scrofa ↔ Mus musculus 1:1 ortholog table and join it to your per-species protein quantification in R (biomaRt) and Python (pandas). Why gene-symbol matching is fragile, how to handle one-to-many and many-to-many orthologs, and how to validate your mapping against UniProt.
Why You Must NOT Merge Species FASTA Databases in Cross-Species Proteomics (Shared Peptide Problem)
In cross-species LC-MS/MS proteomics (Porcine ECM vs Mouse Matrigel, human vs mouse cells, host-pathogen, etc.) merging species FASTA into one search database breaks protein quantification. This is the shared-peptide ambiguity problem. Here's why it happens, how to set up species-separated searches in MaxQuant and DIA-NN, and how to handle peptides that genuinely match both species.
Reanalyzing PRIDE PXD023694 — Matrigel Nuclear Contaminants (EWSR1, RUVBL2) You'll Find in Real Data
A practical reanalysis of PRIDE dataset PXD023694 (Cho lab cross-species ECM) — what proteins actually come out of mouse Matrigel vs porcine ECM, the basement membrane signature (LAMB1/NID1/LAMC1/HSPG2) that replicates Park et al. 2026, and the contamination patterns (Matrigel nuclear proteins EWSR1/RUVBL2, porcine smooth muscle remnants TPM4/MYL9) that simulations miss but real data shows.
From DIA-NN Output to Paper Draft: A Complete AI-Assisted Proteomics Workflow (2026)
An honest, end-to-end guide to using AI across the full DIA-NN proteomics pipeline — from report.tsv quantification and downstream statistics through figure interpretation, Discussion writing, and a manuscript draft. Includes real prompts, where AI fails (with caught examples), verification checklists, and 2026 journal disclosure requirements.