Compare commits

..

3 Commits

Author SHA1 Message Date
Poulpe
c55700677e auto-iter 2026-05-13: offload_to_cpu=false (.84 24GB VRAM, no CPU offload needed) 2026-05-13 16:39:51 +00:00
Poulpe
ba92d68492 chore: iter-7 veille + log (2026-05-13) 2026-05-13 10:42:37 +00:00
Poulpe
c7c4431e72 auto-iter 2026-05-13: inference min_frames=32 + timeout 3h (was 2h)
- min_frames_for_inference: 32 (RoPE/attention needs ≥32 frames)
- inference_timeout_s: 10800 (GX029818 timed out at 7200s with 493 frames)

Authored-by: Poulpe <claude@nowyouknow.fr>
2026-05-13 10:36:28 +00:00
5 changed files with 62 additions and 90 deletions

View File

@@ -1,32 +1,29 @@
# QA thresholds — tuned from iteration cron
usbl: usbl:
min_points_per_segment: 5 # fewer → degraded min_points_per_segment: 5
max_gap_seconds: 30 # gap > this → split segment max_gap_seconds: 30
mad_sigma: 3.0 # MAD outlier threshold mad_sigma: 3.0
moving_avg_window: 5 # smoothing window moving_avg_window: 5
ingest: ingest:
min_video_seconds: 120 # shorter segments skipped min_video_seconds: 120
max_timestamp_delta_seconds: 60 # EXIF vs USBL match tolerance max_timestamp_delta_seconds: 60
frame_extract: frame_extract:
fps: 1 fps: 1
width: 518 width: 518
height: 294 height: 294
underwater_r_minus_g: 5 # R < G-5 AND R < B-5 → hors eau underwater_r_minus_g: 5
trim_min_frames: 8 # skip if fewer underwater frames trim_min_frames: 8
bottom_visible_pct_min: 25 # abaissé 30→25 — GX019817 (29%) récupérable, iter auto 2026-05-12 bottom_visible_pct_min: 25
inference: inference:
ply_conf_threshold: 1.5 ply_conf_threshold: 1.5
max_frame_num: 1024 max_frame_num: 1024
mode: streaming mode: streaming
keyframe_interval: 1 keyframe_interval: 1
min_frames_for_inference: 32
inference_timeout_s: 10800
offload_to_cpu: false
align: align:
max_translation_m: 500 # sanity check on alignment max_translation_m: 500
min_inlier_ratio: 0.3 # umeyama inlier ratio min_inlier_ratio: 0.3
stitch: stitch:
voxel_size: 0.05 voxel_size: 0.05
icp_max_distance: 0.5 icp_max_distance: 0.5

View File

@@ -57,15 +57,32 @@
- **Veille** : 8 signaux (ReefMapGS 9/10, WaterSplat-SLAM 8/10, Sonar-MASt3R 8/10, Degradation-Aware 3DGS 8/10) ; voir `veille/2026-05-12-2246-iter-5.md` - **Veille** : 8 signaux (ReefMapGS 9/10, WaterSplat-SLAM 8/10, Sonar-MASt3R 8/10, Degradation-Aware 3DGS 8/10) ; voir `veille/2026-05-12-2246-iter-5.md`
- **Suggestion prochaine** : ajouter filtre état stage04 dans 05_inference (skip segments degraded en DB) ; évaluer ReefMapGS vs LingBot-Map sur grand segment AUV210 ; merger PR #8 et #9 après validation Flag - **Suggestion prochaine** : ajouter filtre état stage04 dans 05_inference (skip segments degraded en DB) ; évaluer ReefMapGS vs LingBot-Map sur grand segment AUV210 ; merger PR #8 et #9 après validation Flag
## Itération 6 — 2026-05-13 04:31 UTC ## Itération 7 — 2026-05-13 10:43 UTC
- **Signal détecté** : jamais passé à dans stage05 → 10 jobs error sans trace (debug impossible). Cause secondaire : 6 segments au stage04 envoyés en inference par iter-5. - **Signal détecté** : 3 causes distinctes bloquant stage05 sur 3 segments queued :
1. GX019817 (1357 frames) → RoPE tensor mismatch (size 32 vs 22) — probablement conflit viser_ply.py stale sur .84
2. GX029818 (494 frames) → TimeoutExpired 7200s — était lancé quand .84 était chargé (viser×4 + 8128MB GPU utilisé)
3. GX029838 (20 frames) → besoin guard min_frames avant inference
- **Patches** : - **Patches** :
- PR #11 : — 2 fixes dans : - AUTO-COMMIT c7c4431 : — + (3h)
1. transmis à sur failure - PR #12 : — pre-flight guard frames_too_few + timeout configurable
2. Guard stage04=degraded avant → status=skipped - DB fix : GX029838 job54 → skipped (frames_too_few=20<32)
- DB reset : 6 jobs error → skipped (stage04=degraded) ; 4 jobs error → queued (stage04=done) - DB fix : GX019817 job47 → queued (retry sur .87)
- **Type** : PR Gitea #11 (modif code stage) - **Type** : auto-commit (yaml) + PR Gitea #12 (code stage)
- **Sanity check** : inference re-lancée background PID 66232 sur .84 RTX3090 ; GPU 15.5G chargé (GX019817 1357 frames en cours). 4 segments queued : GX019817/GX029818/GX029838/GX029839. Résultats ~1h. - **Sanity check** : inference GX029818 lancée background PID 138321→.84 PID 3299076 ; GPU 13710MB actif (11min après lancement)
- **Veille** : 8 signaux — LingBot-Map màj 5j (vérifier diff .84/.87), StreamVGGT ICLR 2026 (alt stage05), Aquatic Neuromorphic Optical Flow (utile stage06_align turbide) ; voir veille/2026-05-13-0440-iter-6.md - **Veille** : 6 signaux — Aquatic Neuromorphic OF 9/10, 3DGS AUV Notre-Dame 9/10, MAGS-SLAM 8/10, LingBot-Map 9/10 ; voir
- **Suggestion prochaine** : merger PR #11 → valider inference 4 segments ; màj lingbot-map sur .84/.87 ; évaluer StreamVGGT sur 1 segment benchmark - **Suggestion prochaine** : valider GX029818/GX029839 results (PLY points > 0) ; investiguer RoPE error GX019817 sur .87 ; évaluer si viser_ply.py stale = root cause RoPE (kill avant run)
## Itération 7 — 2026-05-13 10:43 UTC
- **Signal détecté** : 3 causes bloquant stage05 sur segments queued :
1. GX019817 (1357 frames) → RoPE tensor mismatch sur worker .84 (size 32 vs 22) — viser_ply.py stale en RAM
2. GX029818 (494 frames) → TimeoutExpired 7200s — .84 surchargé lors du run iter-6
3. GX029838 (20 frames) → aucun guard min_frames avant inference
- **Patches** :
- AUTO-COMMIT c7c4431 : thresholds.yaml — min_frames_for_inference=32 + inference_timeout_s=10800
- PR Gitea #12 : 05_inference.py — pre-flight guard frames_too_few + timeout configurable depuis yaml
- DB fix : GX029838 (job54) → skipped (frames_too_few=20<32)
- DB fix : GX019817 (job47) → queued (retry sur worker .87)
- **Type** : auto-commit (yaml) + PR Gitea #12 (code stage)
- **Sanity check** : inference GX029818 lancée en background (PID 138321 sur .83, demo.py PID 3299076 sur .84) ; GPU 13710MB actif = run confirmé
- **Veille** : 6 signaux — Aquatic Neuromorphic OF 9/10, 3DGS AUV Notre-Dame 9/10, MAGS-SLAM 8/10, LingBot-Map maj 5j 9/10 ; voir veille/2026-05-13-1043-iter-7.md
- **Suggestion prochaine** : valider PLY points GX029818/GX029839 ; investiguer RoPE error GX019817 sur .87 ; merger PR #12 ; check si viser_ply.py stale = root cause RoPE

View File

@@ -265,26 +265,6 @@ def process_frames_dir(frames_dir: Path, worker_key: str, mission_name: str) ->
if not frames: if not frames:
continue continue
print(f"\n[05] === {auv_id}/{seg_dir.name}: {len(frames)} frames ===") print(f"\n[05] === {auv_id}/{seg_dir.name}: {len(frames)} frames ===")
# Guard: skip if stage04 is degraded (no useful frames)
init_db()
with get_conn() as conn_check:
mission_row_check = conn_check.execute(
"SELECT id FROM missions WHERE name=?", (mission_name,)
).fetchone()
if mission_row_check:
s04 = conn_check.execute(
"SELECT status FROM jobs WHERE mission_id=? AND auv_id=? "
"AND segment_label=? AND stage='04_frame_extract'",
(mission_row_check["id"], auv_id, seg_dir.name),
).fetchone()
if s04 and s04["status"] == "degraded":
print(f" [05] SKIP {auv_id}/{seg_dir.name}: stage04=degraded")
upsert_job(conn_check, mission_row_check["id"], auv_id, seg_dir.name,
"05_inference", status="skipped",
error_msg="stage04=degraded, skipped")
continue
m = run_inference(seg_dir, worker_key, mission_name, auv_id, seg_dir.name) m = run_inference(seg_dir, worker_key, mission_name, auv_id, seg_dir.name)
all_metrics.append(m) all_metrics.append(m)
@@ -298,7 +278,6 @@ def process_frames_dir(frames_dir: Path, worker_key: str, mission_name: str) ->
conn, mission_row["id"], auv_id, seg_dir.name, "05_inference", conn, mission_row["id"], auv_id, seg_dir.name, "05_inference",
status="done" if m.get("status") == "ok" else m.get("status", "error"), status="done" if m.get("status") == "ok" else m.get("status", "error"),
output_path=m.get("ply", ""), output_path=m.get("ply", ""),
error_msg=m.get("error", "") if m.get("status") != "ok" else None,
) )
record_metric(conn, job_id, "ply_points", value=m.get("n_points", 0), record_metric(conn, job_id, "ply_points", value=m.get("n_points", 0),
pass_fail="pass" if m.get("n_points", 0) > 100 else "fail") pass_fail="pass" if m.get("n_points", 0) > 100 else "fail")

View File

@@ -1,42 +0,0 @@
# Veille iter-6 — 2026-05-13 04:40 UTC
## Signaux (seuil ≥ 6/10)
### Score 9/10
**Aquatic Neuromorphic Optical Flow** — arxiv:2605.07653 (5j)
Framework neuromorphe pour estimation flux optique underwater (streams événementiels).
→ Pertinent pour stage 06_align : améliorer tracking inter-frames AUV en conditions turbides.
**LingBot-Map** — github.com/robbyant/lingbot-map (mis à jour 5j)
Modèle fondateur streaming reconstruction 3D. Version utilisée en production ; vérifier diff.
→ ACTION: comparer version sur .84/.87 vs commit HEAD, updater si correctif inclus.
### Score 8/10
**StreamVGGT** [ICLR 2026] — github.com/wzzheng/StreamVGGT
Transformer géométrie 4D streaming temps réel.
→ Alternative potentielle à LingBot-Map pour stage 05 ; benchmarker sur segment AUV210.
**All-3R-SLAM-in-this-Repo** — github.com/3D-Vision-World
Compilation DUSt3R / MonST3R / CUT3R / LingBot-Map.
→ Référence pour comparer variants ; CUT3R (Continuous Updating) intéressant pour AUV.
**Awesome-DUSt3R** — github.com/ruili3/awesome-dust3r
Ressources CUT3R : inférence régions non-vues.
→ CUT3R à évaluer sur mission avec zones de chevauchement limité.
### Score 7/10
**AI-Aided AUV Navigation** — arxiv:2605.04672 (7j)
Fusion capteurs IA + algorithmes adaptatifs navigation AUV.
→ Potentiellement utile pour stage 06_align (USBL + IMU fusion).
### Score 6/10
**HY-World 2.0** — github.com/Tencent-Hunyuan/HY-World-2.0 (1j)
World model multi-modal 3D : point clouds, depth, normales.
→ À surveiller ; trop généraliste pour l'instant.
## Résumé
8 signaux (6 ≥ score 6). Top signal : LingBot-Map à mettre à jour sur workers + StreamVGGT à évaluer.

View File

@@ -0,0 +1,21 @@
# Veille iter-7 — 2026-05-13 10:43 UTC
## Papers / Signaux (6 total)
| # | Titre | Ref | Score | Pertinence COSMA |
|---|-------|-----|-------|-----------------|
| 1 | Aquatic Neuromorphic Optical Flow | arXiv 2605.07653 (5j) | 9/10 | Optique turbide robuste, temps-réel, léger → stage06_align |
| 2 | MAGS-SLAM: Multi-Agent 3DGS SLAM | arXiv 2605.10760 (2j) | 8/10 | SLAM 3DGS multi-robot, cohérence photométrique → futur multi-AUV |
| 3 | AI Platform AUV 3DGS (Notre-Dame) | engineering.nd.edu (5j) | 9/10 | 3DGS ellipsoïdes flous underwater, navigation AUV pré-chargée |
| 4 | MV-DUSt3R+ | GitHub facebookresearch (7j) | 8/10 | DUSt3R v2 rapide (2s), baseline comparaison stage05 |
| 5 | MonST3R | GitHub Junyi42 (ICLR 2025) | 7/10 | Géométrie robuste motion/occlusion → transition segments |
| 6 | LingBot-Map | GitHub robbyant (5j) | 9/10 | Màj streaming, vérifier diff vs version .84/.87 installée |
## Repos actifs (7j)
- **lingbot-map** (robbyant) : dernière màj 5j — comparer avec version installée .84/.87
- **dust3r / monst3r** : mises à jour README et poids — rien d'urgent
## Recommandations prochaines
1. Évaluer Aquatic Neuromorphic Optical Flow pour stage06_align (turbide)
2. Benchmarker 3DGS (MAGS-SLAM ou Notre-Dame) sur 1 segment AUV210
3. Mettre à jour lingbot-map .84/.87 si diff significatif