Data science/ML/AI

Statistical Moments (M1, M2) for Data Analysis

Here are 5 curated PDFs diving into the mean (M1), variance (M2), and their applications in crafting research questions and sourcing data.

A channel member requested resources on this topic and we delivered.

If you have a topic you want resources on let us know, and we’ll make it happen!

@datascience_bds

moment.pdf

93.7 KB

Experimental-Design_Statistical-Analysis-of-Data.pdf

1.6 MB

Method of moments.pdf

1.3 MB

Lec6_Methods of moment estimator.pdf

250.5 KB

❤8

871 views09:05

Data science/ML/AI

Excel Vs SQL Vs Python

❤6👍3

720 views09:10

Data science/ML/AI

Basic SQL Commands

❤2

612 views07:35

Data science/ML/AI

📚 Data Science Riddle

Why do we use Batch Normalization?

Anonymous Quiz

❤3

90 voters604 views11:15

Data science/ML/AI

LLM Cheatsheet

❤5

515 views06:55

Data science/ML/AI

📚 Data Science Riddle

Your object detection model misses small objects. Easiest fix?

Anonymous Quiz

24%

Use larger input images

67 voters484 views09:20

Data science/ML/AI

🤖 AI that creates AI: ASI-ARCH finds 106 new SOTA architectures

ASI-ARCH — experimental ASI that autonomously researches and designs neural nets. It hypothesizes, codes, trains & tests models.

💡 Scale:
1,773 experiments → 20,000+ GPU-hours.
Stage 1 (20M params, 1B tokens): 1,350 candidates beat DeltaNet.
Stage 2 (340M params): 400 models → 106 SOTA winners.
Top 5 trained on 15B tokens vs Mamba2 & Gated DeltaNet.

📊 Results:
PathGateFusionNet: 48.51 avg (Mamba2: 47.84, Gated DeltaNet: 47.32).
BoolQ: 60.58 vs 60.12 (Gated DeltaNet).
Consistent gains across tasks.
🔍 Insights:
Prefers proven tools (gating, convs), refines them iteratively.
Ideas come from: 51.7% literature, 38.2% self-analysis, 10.1% originality.
SOTA share: self-analysis ↑ to 44.8%, literature ↓ to 48.6%.

@datascience_bds

❤4

384 views08:57

Data science/ML/AI

🚀 Databricks Tip: REPLACE vs MERGE

When updating Delta tables, you’ve got two powerful options:

🔹 REPLACE TABLE … ON
📚 Like throwing away the entire library and rebuilding it.
- Drops the old table & recreates it.
- Schema + data = fully replaced.
- ⚡ Super fast but destructive (old data gone).
- ✅ Best for full refreshes or schema changes.

🔹 MERGE
📖 Like updating only the books that changed.
- Works row by row.
- Updates, inserts, or deletes specific records.
- 🔍 Preserves unchanged data.
- ✅ Best for incremental updates or CDC (Change Data Capture).

⚖️ Key Difference
- REPLACE = Start fresh with a new table.
- MERGE = Surgically update rows without losing the rest.

👉 Rule of thumb:
Use REPLACE for full rebuilds,
Use MERGE for incremental upserts.

#Databricks #DeltaLake

❤3

154 viewsedited 08:48

2025/10/22 13:01:13
Back to Top

HTML Embed Code:

<iframe width="100%" src="https://www.bootg.com/buyppe/web?embed=1" title="Telegram Web" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>