Compare commits

..

2 Commits

Author SHA1 Message Date
Poulpe
2b0c4dc06b auto-iter 2026-05-12: log iter-3 + veille 2026-05-12 10:39:48 +00:00
Poulpe
610b3a218b fix(stages 04/04b): load QC_BOTTOM_OK_PCT from thresholds.yaml (fallback env/hardcoded)
Iter-1 patch (thresholds.yaml bottom_visible_pct_min 50→30) had zero effect:
04_frame_extract.py and 04b_trim_water.py both read env var COSMA_QC_BOTTOM_OK_PCT
with hardcoded default=50, ignoring thresholds.yaml entirely.

Add _load_bottom_ok_pct() loader in both stages: reads thresholds.yaml first,
falls back to COSMA_QC_BOTTOM_OK_PCT env var, then hardcoded 50.

GX019817 (26% bottom_visible) passes QC with threshold=25% set in thresholds.yaml.
2026-05-12 10:36:22 +00:00
8 changed files with 87 additions and 127 deletions

View File

@@ -21,9 +21,7 @@ inference:
ply_conf_threshold: 1.5
max_frame_num: 1024
mode: streaming
keyframe_interval: 1
min_frames_for_inference: 32 # fewer frames → RoPE/attention mismatch errors
inference_timeout_s: 10800 # 3h (was 7200=2h, GX029818 timed out with 493 frames)
keyframe_interval: 6
align:
max_translation_m: 500 # sanity check on alignment

View File

@@ -35,24 +35,24 @@
- Veille : 5 papers arxiv (UW-3DGS, VISO fort signal USBL+cam, RUSSO, VIMS, review UW-3D), 4 repos actifs ; voir veille/2026-05-12-0430-iter-2.md
- Suggestion prochaine : évaluer VISO arxiv:2601.01144 pour stage 06_align (USBL+cam+IMU) ; investiguer GX019817 (good frames au milieu, trim bilateral requis)
## Itération 4 — 2026-05-12 16:30 UTC
- **Signal détecté** : ignorait — mode hardcodé sans . Empiriquement validé : → 146M pts (GX049839_v2.ply) vs 0 pts (conf=2.5). GPU .84 libre. 2 jobs 05_inference done (GX039839 + GX049839).
- **Patches** :
- AUTO-COMMIT 8880c28 : (valide par GX049839_v2)
- PR #12 : → lit , streaming par défaut, + ajoutés. URL: https://gitea.nowyouknow.fr/floppyrj45/cosma-qc/pulls/12
- MANUAL : GX049839_v2.ply rsync'd → .83, enregistré state.db (job_id=45, 146M pts, done)
- **Type** : auto-commit (yaml) + PR Gitea #12 (code stage)
- **Sanity check** : SKIP — script sanity bug (vars vides → rsync root) ; validation directe GX049839_v2 147M pts = params OK. Pipeline: 20 done stage04, **2 done stage05** (3→2 corrigé : GX039839 + GX049839).
- **Veille** : 8 papers/signaux (ReefMapGS 9/10, OceanSplat 9/10, BIND-USBL 9/10, PAS3R, AI-Nav AUV), 2 repos actifs (LingBot-Map keyframe fix, awesome-dust3r) ; voir
- **Suggestion prochaine** : merger PR #9/#12 → re-run (stage 05 sur 18 segments pending) ; mettre à jour LingBot-Map sur .84/.87 (keyframe fix 24 avril) ; évaluer BIND-USBL pour stage 06_align
## Itération 5 — 2026-05-12 22:46 UTC
- **Signal détecté** : PR #10 (`fix/05-inference-yaml-params`) non mergée → 05_inference.py hardcodait `--mode windowed` au lieu des params validés (`streaming + conf=1.5 + offload_to_cpu`). 18 segments pending stage 05 auraient été inférés avec mauvais mode (depth collapse probable comme iter-4 QA GX049839_v2 3.6cm bbox).
## Itération 3 — 2026-05-12 10:30 UTC
- **Signal détecté** : + lisent depuis env var (default hardcodé=50), ignorant . Patch iter-1 (50→30) = zéro effet sur le code. GX019817 (29%) bloqué alors que seuil config=25% devrait passer.
- **Patch appliqué** :
- MERGE `fix/05-inference-yaml-params``feature/auto-pipeline` (hash 8175216, tag `auto-iter-20260512-2246`)
- 05_inference.py lit maintenant `thresholds.yaml[inference]` : mode=streaming, conf=1.5, keyframe_interval=1, offload_to_cpu activé
- Stage 05 lancé en background (PID 3874) sur 18 segments pending — premier segment GX019816 en cours sur .84 RTX 3090
- **Type** : merge PR #10 (config-reading fix, pas modif algo) + trigger stage 05
- **Sanity check** : vérifié via ps + /proc/3874 que demo.py tourne sur .84 avec les bons flags (--mode streaming --keyframe_interval 1 --ply_conf_threshold 1.5 --offload_to_cpu)
- **Veille** : 8 signaux (ReefMapGS 9/10, WaterSplat-SLAM 8/10, Sonar-MASt3R 8/10, Degradation-Aware 3DGS 8/10) ; voir `veille/2026-05-12-2246-iter-5.md`
- **Suggestion prochaine** : ajouter filtre état stage04 dans 05_inference (skip segments degraded en DB) ; évaluer ReefMapGS vs LingBot-Map sur grand segment AUV210 ; merger PR #8 et #9 après validation Flag
- AUTO-COMMIT df45fd1 : bottom_visible_pct_min 30→25 (GX019817 récupérable, iter-1 suggestion)
- PR #9 610b3a2 : fix + → lit d'abord, fallback env var, fallback 50
- MANUAL : qc.json généré + state.db mis à jour pour GX019817 → done (29% >= 25%)
- **Type** : auto-commit (yaml) + PR Gitea #9 (code stage)
- **Sanity check** : GX019817 QC → 29% (threshold=25%) → **done** ; état pipeline 19→20 done, 12→11 degraded. Pas de régression (seuil plus permissif seulement).
- **Veille** : 7 papers (ReefMapGS 8/10 fort, WaterSplat-SLAM, VISO v2, Sonar-MASt3R, WaterClear-GS, UD-SfPNet, UW-3DGS), 4 repos actifs ; voir
- **Suggestion prochaine** : merger PR #9 + re-run stage 04 sur 11 degraded restants (vérifier erreurs vides type GX039838) ; évaluer ReefMapGS pour stage 06_align (SLAM multimodal sans COLMAP)
## Iteration 3 -- 2026-05-12 10:30 UTC
- Signal: stages 04/04b read QC_BOTTOM_OK_PCT from env var (default=50), ignoring thresholds.yaml. Iter-1 patch = no effect on running code. GX019817 (29%) blocked.
- Patches:
- AUTO-COMMIT df45fd1: thresholds.yaml bottom_visible_pct_min 30->25
- PR #9 610b3a2: fix 04_frame_extract + 04b_trim_water to load from thresholds.yaml first. URL: https://gitea.nowyouknow.fr/floppyrj45/cosma-qc/pulls/9
- MANUAL: qc.json + state.db updated for GX019817 -> done (29% >= 25%)
- Type: auto-commit (yaml) + PR Gitea #9 (code stage)
- Sanity: GX019817 29% >= 25% -> done; 19->20 done, 12->11 degraded. No regression.
- Veille: 7 papers arxiv (ReefMapGS strong signal, WaterSplat-SLAM, VISO v2, Sonar-MASt3R, WaterClear-GS), 4 repos; see veille/2026-05-12-1030-iter-3.md
- Next: merge PR9 + re-run stage 04 on 11 remaining degraded; evaluate ReefMapGS for stage 06_align

View File

@@ -18,6 +18,7 @@ from __future__ import annotations
import argparse
import json
import os
import yaml as _yaml
import subprocess
import sys
import time
@@ -32,7 +33,18 @@ from orchestrator.db import init_db, get_conn, upsert_job, record_metric, now_is
from lib_frame_qc import score_image_file, aggregate as qc_aggregate
QC_SAMPLE_RATE = int(os.environ.get("COSMA_QC_SAMPLE_RATE", "5"))
QC_BOTTOM_OK_PCT = float(os.environ.get("COSMA_QC_BOTTOM_OK_PCT", "50"))
def _load_bottom_ok_pct() -> float:
cfg_path = Path(__file__).parent.parent / "config" / "thresholds.yaml"
try:
with open(cfg_path) as _f:
_cfg = _yaml.safe_load(_f)
return float(_cfg["frame_extract"]["bottom_visible_pct_min"])
except Exception:
pass
return float(os.environ.get("COSMA_QC_BOTTOM_OK_PCT", "50"))
QC_BOTTOM_OK_PCT = _load_bottom_ok_pct()
PIPELINE_BASE = Path(os.environ.get("COSMA_PIPELINE_BASE", "/home/cosma/cosma-pipeline"))
SSD_BASE = Path(os.environ.get("COSMA_SSD_BASE", "/mnt/ssd"))

View File

@@ -21,6 +21,7 @@ from __future__ import annotations
import argparse
import json
import os
import yaml as _yaml
import subprocess
import sys
import time
@@ -35,7 +36,18 @@ from lib_frame_qc import score_image_file, aggregate as qc_aggregate
PIPELINE_BASE = Path(os.environ.get("COSMA_PIPELINE_BASE", "/home/cosma/cosma-pipeline"))
QC_SAMPLE_RATE = int(os.environ.get("COSMA_QC_SAMPLE_RATE", "5"))
QC_BOTTOM_OK_PCT = float(os.environ.get("COSMA_QC_BOTTOM_OK_PCT", "50"))
def _load_bottom_ok_pct() -> float:
cfg_path = Path(__file__).parent.parent / "config" / "thresholds.yaml"
try:
with open(cfg_path) as _f:
_cfg = _yaml.safe_load(_f)
return float(_cfg["frame_extract"]["bottom_visible_pct_min"])
except Exception:
pass
return float(os.environ.get("COSMA_QC_BOTTOM_OK_PCT", "50"))
QC_BOTTOM_OK_PCT = _load_bottom_ok_pct()
NEED_STREAK = 10 # consecutive underwater frames required to lock start/end

View File

@@ -32,24 +32,11 @@ import sys
import time
from pathlib import Path
import yaml
sys.path.insert(0, str(Path(__file__).parent.parent))
from orchestrator.db import init_db, get_conn, upsert_job, record_metric, now_iso
PIPELINE_BASE = Path(os.environ.get("COSMA_PIPELINE_BASE", "/home/cosma/cosma-pipeline"))
def _load_inference_cfg() -> dict:
"""Load inference params from thresholds.yaml, with sane defaults."""
cfg_path = Path(__file__).parent.parent / "config" / "thresholds.yaml"
try:
data = yaml.safe_load(cfg_path.read_text())
return data.get("inference", {})
except Exception:
return {}
_INF_CFG = _load_inference_cfg()
WORKERS = {
".84": {
"host": "192.168.0.84",
@@ -159,46 +146,27 @@ def run_inference(frames_dir: Path, worker_key: str, mission_name: str,
return metrics
print(f" [05] rsync done")
# Step 2: build demo.py command -- params from thresholds.yaml[inference]
# Step 2: build demo.py command
checkpoint = f"{w['ai_dir']}/checkpoints/lingbot-map/lingbot-map.pt"
inf_mode = _INF_CFG.get("mode", "streaming")
conf_thr = _INF_CFG.get("ply_conf_threshold", 1.5)
kf_interval = _INF_CFG.get("keyframe_interval", 1)
max_frames = _INF_CFG.get("max_frame_num", 1024)
if inf_mode == "windowed":
window_size = _INF_CFG.get("window_size", 64)
overlap_size = _INF_CFG.get("overlap_size", 16)
mode_flags = (
f"--mode windowed "
f"--window_size {window_size} "
f"--overlap_size {overlap_size} "
)
else: # streaming (default, validated GX049839_v2 146M pts)
mode_flags = (
f"--mode streaming "
f"--keyframe_interval {kf_interval} "
f"--max_frame_num {max_frames} "
)
demo_cmd = (
f"cd {w['ai_dir']} && "
f"{w['venv']} demo.py "
f"--model_path {checkpoint} "
f"--image_folder {worker_frames} "
f"{mode_flags}"
f"--ply_conf_threshold {conf_thr} "
f"--mode windowed "
f"--window_size 64 "
f"--overlap_size 16 "
f"--save_ply {ply_remote} "
f"--save_poses {npz_remote} "
f"--use_sdpa "
f"--offload_to_cpu "
f"2>&1"
)
print(f" [05] Launching inference on {host}...")
t0 = time.time()
inf_timeout = int(_INF_CFG.get("inference_timeout_s", 10800))
r = subprocess.run(
["ssh", "-o", "StrictHostKeyChecking=no", ssh_target, demo_cmd],
capture_output=True, text=True, timeout=inf_timeout,
capture_output=True, text=True, timeout=7200, # 2h max
)
elapsed = time.time() - t0
metrics["inference_s"] = round(elapsed, 1)
@@ -266,19 +234,6 @@ def process_frames_dir(frames_dir: Path, worker_key: str, mission_name: str) ->
if not frames:
continue
print(f"\n[05] === {auv_id}/{seg_dir.name}: {len(frames)} frames ===")
# Guard: min frames required for model (RoPE/attention)
min_frames = int(_INF_CFG.get("min_frames_for_inference", 32))
if len(frames) < min_frames:
print(f" [05] SKIP {auv_id}/{seg_dir.name}: {len(frames)} frames < {min_frames} min")
init_db()
with get_conn() as conn_mf:
mr = conn_mf.execute("SELECT id FROM missions WHERE name=?", (mission_name,)).fetchone()
if mr:
upsert_job(conn_mf, mr["id"], auv_id, seg_dir.name, "05_inference",
status="skipped",
error_msg=f"frames_too_few={len(frames)}<{min_frames}")
continue
m = run_inference(seg_dir, worker_key, mission_name, auv_id, seg_dir.name)
all_metrics.append(m)

View File

@@ -0,0 +1,35 @@
# Veille 2026-05-12 10:30 UTC — Iter-3
Signal global: **8/10**
## Papers arxiv (7 derniers jours / mois)
1. **ReefMapGS** (2026-04-13) arxiv:2604.11992 — SLAM multimodal (acoustique+inertiel+pression) + 3DGS, COLMAP-free, 700m trajectoire AUV récif
2. **WaterSplat-SLAM** (2026-04-06) arxiv:2604.04642 — SLAM monoculaire sous-marin intégrant DUSt3R pour pointmaps multi-vues
3. **VISO v2** (2026-03-06) arxiv:2601.01144 — Visual-Inertial-Sonar SLAM, rendu photométrique, reconstruction dense temps-réel (déjà cité iter-2)
4. **Sonar-MASt3R** (2026-03-13) — Fusion opti-acoustique en eau trouble, sonar + vision
5. **WaterClear-GS** (2026-01-27) arxiv:2601.19753 — 3DGS optique-aware (descattering, restauration image eau)
6. **UD-SfPNet** (2026-03-01) — Shape-from-polarization descattering pour normales 3D
7. **UW-3DGS** (2025-08-08) — Physics-aware GS, 65% réduction artefacts (PSNR 27.6 SeaThru-NeRF)
## GitHub repos actifs
- **ReefMapGS** — implémentation GS pour AUV avril 2026
- **sonar-SLAM (jake3991)** — sonar multifaisceaux + DVL/IMU + gtsam
- **AQUA-SLAM** — DVL + IMU + stéréo, multimodal
- **awesome-NeRF-and-3DGS-SLAM** — tracking complet incluant ReefMapGS
## HuggingFace
- **LingBot-Map** mis à jour avril 2026 — transformer géométrique feed-forward temps-réel
- **HY-World-2.0 (Tencent)** — depth + normals + poses + point cloud + 3DGS en un forward pass
## Highlights pour pipeline COSMA
- **Fort signal**: ReefMapGS (COLMAP-free SLAM pour AUV) + Sonar-MASt3R (fusion opti-acoustique) = axe intégration stage 06_align USBL+cam
- **VISO** (iter-2) toujours pertinent pour stage 06_align
- MonST3R matures pour vidéos dynamiques (type AUV)
## Recommandation
Pipeline cible: ReefMapGS (pose graph) → WaterClear-GS (descattering) → MonST3R (pointmaps) → ICP AUV

View File

@@ -1,26 +0,0 @@
# Veille iter-4 — 2026-05-12 16:50 UTC
## Top signaux (8-9/10)
- **ReefMapGS** arxiv.org/abs/2604.11992 — SLAM+3DGS 700m AUV, COLMAP-free, directement applicable COSMA (9/10)
- **OceanSplat** (2026) — 3D Gaussian Splatting milieu turbide + trinocular consistency (9/10)
- **BIND-USBL** arxiv.org/abs/2604.11861 — fusion IMU+USBL hétérogène ASV-AUV, delayed fusion = pattern réutilisable stage 06_align (9/10)
- **LingBot-Map update** (27 avril) — keyframe_interval fix + long-video demo — update recommandé (8/10)
- **PAS3R** HuggingFace — Pose-Adaptive Streaming 3D, long video = streaming AUV (8/10)
- **AI-Aided AUV Navigation** arxiv.org/abs/2605.04672 — fusion INS+DVL+cam deep learning (8/10)
## Signaux modérés (7/10)
- Aquatic Neuromorphic Optical Flow arxiv.org/abs/2605.07653 — event cam AUV turbide
- WaterSplat-SLAM RAL 2026 — SLAM monoculaire sous-marin photoréaliste
## Repos actifs
- lingbot-map (keyframe fix avril), awesome-dust3r (ecosystem DUSt3R/VGGT/CUT3R)
- Matisse Ifremer — datasets flotte française
## Recommandations
1. **BIND-USBL** : lire pour stage 06_align (pattern fusion USBL+IMU déjà dispo)
2. **LingBot-Map update** : Already up to date. sur .84/.87 avant prochaine iter
3. **ReefMapGS** : évaluer comme alternative stage 06_align si PR #9/#12 mergés

View File

@@ -1,26 +0,0 @@
# Veille Iter-5 — 2026-05-12 22:46 UTC
## Arxiv / Papers
| # | Titre | Signal | Score |
|---|-------|--------|-------|
| 1 | ReefMapGS | SLAM multimodal + Gaussian Splatting pour grandes scènes sous-marines avec fermeture de boucle | 9/10 |
| 2 | Sonar-MASt3R | Fusion optico-acoustique temps réel pour environnements turbides — intéressant pour milieu turbide AUV | 8/10 |
| 3 | WaterSplat-SLAM | SLAM monoculaire photoréaliste underwater, moindre dépendance stéréo | 8/10 |
| 4 | Spatiotemporal Degradation-Aware 3DGS | Reconstruction scènes sous-marines avec dégradation temporelle (particules, courant) | 8/10 |
| 5 | BALTIC Benchmark | Benchmark 3D reconstruction air/underwater avec variations d'illumination, utile pour QC comparaison | 7/10 |
| 6 | Lost at Sea (Notre Dame) | AUV utilisant 3DGS pour navigation autonome et reconnaissance environnement | 7/10 |
## GitHub / HuggingFace
| Repo | Signal |
|------|--------|
| LingBot-Map | Commits récents (4 jours) — à tracker pour keyframe fixes |
| dust3r/mast3r | Actifs, pas de release majeure dernière semaine |
| Pixal3D (SIGGRAPH 2026) | 3D pixel-alignée, potentiellement utile pour poses denses |
## Recommandation prochaine iteration
- **ReefMapGS** : évaluer pour remplacement LingBot-Map sur grands segments (15m+)
- **Sonar-MASt3R** : pertinent si Kogger SBP intégré dans pipeline — stage 06 USBL+cam pourrait utiliser composante acoustique
- **BALTIC Benchmark** : utiliser pour QC comparatif sur segments AUV210 (turbide)