| 2026-03-14 |
monitoring: auto-discover SMART devices instead of hardcoding
...
Remove smartDevices option and per-host device lists. Telegraf will
now scan all block devices automatically, so disks can be added or
removed without config changes.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
skydick: increase NFS server threads to 64
...
Default 8 threads is insufficient for 10GbE throughput.
64 threads allow better parallelism for concurrent NFS clients.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
common: disable global flake registry fetch
...
channels.nixos.org is unreachable from CN, causing 25s of
retries on every nix-shell/nix run invocation.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
common: add TUNA mirror as primary Nix substituter, add btop
...
cache.nixos.org has ~1.1s latency from CN. TUNA mirror responds
in ~29ms (38x faster). Set connect-timeout=5 and
stalled-download-timeout=15 to fail fast on unreachable mirrors.
Also add btop to skydick monitoring packages.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
monitoring: add ZFS pool health exec input
...
Custom script reports zpool health as numeric metric (0=ONLINE,
1=DEGRADED, 2=FAULTED, etc.) via Telegraf inputs.exec, enabling
Grafana alerting on pool degradation.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
skydick: use all_squash for media/torrent NFS exports
...
Map all NFS client UIDs to qbittorrent:storage (900:997) on
media and torrent exports. Eliminates need for UID/GID
coordination between NFS clients and server.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
skydick: fix qbittorrent UID collision with ylw
...
UID 1002 was already assigned to ylw on skydick. Change qbittorrent
system user to UID 900 to avoid the collision. NFS sec=sys maps by
UID number, so this must not conflict with any normal user.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
monitoring: fix InfluxDB URL and add nvme-cli to Telegraf PATH
...
Use door1's LAN IP (10.0.91.30) instead of WireGuard IP (172.16.1.1)
for InfluxDB endpoint. Add nvme-cli to Telegraf's PATH for NVMe SMART
attribute collection.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
skydick: add qbittorrent user and make media NFS export writable
...
Add qbittorrent system user (UID 1002, group storage) for NFS
root_squash write access. Change /srv/media export from ro,async
to rw,sync to support *arr torrent downloads under /srv/media/torrents/.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
monitoring: add lm_sensors and smartmontools to Telegraf PATH
...
Telegraf inputs.sensors needs the `sensors` binary in PATH.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
skydick: add Telegraf monitoring with SMART, ZFS, and system metrics
...
Sends metrics to door1 InfluxDB (bucket: skydick) via Telegraf.
Monitors all 5 Mach2 SAS drives, NVMe P4500, and boot SSD via SMART.
InfluxDB token encrypted with agenix.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
skydick: add NVMe SLOG+L2ARC documentation for datapool
...
Intel DC P4600 750GB: 8GB SLOG partition for sync write acceleration,
remaining ~690GB as L2ARC read cache for VM working sets.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
| 2026-03-13 |
skydick: fill in actual Mach2 WWNs for datapool creation
...
5x ST14000NM0001 14TB SAS Mach2 drives, 4 active + 1 spare.
Mirror vdevs pair LUN0/LUN1 from different physical drives.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
skydick: extract datapool.nix for Mach2 ZFS storage config
...
Move all storage-serving config (NFS, Samba, iSCSI, tmpfiles, firewall
ports, storage group) from default.nix into datapool.nix. Add Mach2
dual-actuator mirror layout documentation, new datasets (torrent, vm),
and clean permission model (setgid storage group for user data).
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
| 2026-03-11 |
users: equalize ldx and ylw permissions
...
- Add ylw to NOPASSWD sudo rule (matching ldx for deploy-rs)
- Add ldx hashedPassword on xlab-gateway (matching ylw)
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
users: unify ylw as common admin, keep host-specific passwords and groups
...
Move ylw base identity (isNormalUser, wheel, SSH key) to modules/users.nix
alongside ldx. Host configs retain only extra groups and hashedPassword.
Also renames ye-lw21 to ylw on skydick.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
nix: add ldx as trusted-user for deploy-rs unsigned store paths
...
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
xlab-gateway: fix ylw user missing isNormalUser and add wheel group
...
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
add ylw user to xlab-gateway
|
| 2026-03-09 |
xlab-gateway: add ULA route for internal IPv6 reachability
...
Route fd99:23eb:1682::/48 via wg-to-wgnet in main table so
xlab-gateway and LAN clients can reach 10.0.0.1's IPv6
(fd99:23eb:1682::1). Without this, ULA traffic went through
the WAN default route and got dropped.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
users: grant ldx full NOPASSWD sudo for deploy-rs
...
deploy-rs runs activate-rs, nix-env, switch-to-configuration, and
confirmation commands through separate non-interactive SSH sessions.
Per-command NOPASSWD rules cannot cover all paths it uses. Full
NOPASSWD is the intended deploy-rs setup.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
skydick: disable IPv6 to avoid wg-outbound ULA masquerade timeout
...
IPv6 traffic from skydick goes through freedom routing on 10.0.0.1,
which masquerades with ULA fd99:23eb:1682:fe::2 (not globally routable).
This causes nix/cargo downloads to timeout on AAAA records. Skydick has
no IPv6 use cases as a LAN storage server.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
refactor xlab-gateway IPv6: /64 SLAAC, scoped policy routing, no NAT66
...
- Change LAN prefix from fd99:23eb:1682:fd:df::/80 to fd99:23eb:1682:1::/64
- Switch from DHCPv6 stateful to SLAAC (AdvAutonomous on, AdvManagedFlag off)
- Add RDNSS in radvd pointing to fd99:23eb:1682::1
- Scope policy routing rules to LAN sources only (fixes gateway IPv6 breakage)
- Remove NAT66 masquerade on wg-to-wgnet (10.0.0.1 routes fd99:23eb:1682:1::/64 natively)
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
add systemctl and reboot to NOPASSWD sudo rules
...
Needed for restarting services (systemd-networkd, nftables) after
deploy when switch-to-configuration doesn't detect unit changes.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
| 2026-03-08 |
fix nftables syntax: use meta nfproto ipv6 in inet table
...
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
xlab-gateway: route client IPv6 through wg-to-wgnet
...
- Re-add IPv6 default route (::/0) via wg-to-wgnet in table 1002
- Add NAT66 masquerade on wg-to-wgnet for client ULA→tunnel translation
- Campus WAN has no IPv6 transit; wgnet provides it
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
| 2026-03-07 |
Add switch-to-configuration to NOPASSWD sudo rules
...
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
Add skydick SSH key, set xlab-gateway deploy to LAN IP
...
- Authorize ldx@skydick ed25519 key for cross-machine deploy-rs
- Change xlab-gateway deploy hostname to 10.253.254.1 (LAN, reachable
from skydick)
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
deploy-rs: update xlab-gateway hostname, add NOPASSWD sudo
...
- Change xlab-gateway deploy hostname to WAN IP (166.111.98.29)
- Add NOPASSWD sudo rules for deploy-rs activation commands
(nix-env, activate scripts)
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
xlab-gateway: fix IPv6 routing, clean up table names
...
- Remove IPv6 default route from freedom table (1002) — wgnet peer
doesn't forward IPv6, causing TLS resets on outbound connections
- Remove Tsinghua IPv6 throw route (unnecessary without IPv6 default)
- IPv6 now uses native WAN path instead of WireGuard tunnel
- Rename table 1002 from typo 'wg-to-skywme' to 'freedom-wgnet'
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|