ABran-WMF (arnaudb)
User

Projects

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Saturday

  • Clear sailing ahead.

User Details

User Since
Aug 29 2023, 8:30 AM (47 w, 2 d)
Availability
Busy Busy until Aug 12.
LDAP User
Arnaudb
MediaWiki User
ABran-WMF [ Global Accounts ]

Recent Activity

Fri, Jul 19

ABran-WMF updated the task description for T367781: Drop deprecated abuse filter fields on wmf wikis.
Fri, Jul 19, 12:52 PM · Schema-change-in-production, Data-Engineering, DBA
ABran-WMF updated subscribers of T365998: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 -lsw1-f3-eqiad .

@Marostegui and I will be absent on tuesday, hosts have been depooled and are ready.

Fri, Jul 19, 12:25 PM · SRE-swift-storage, DBA, Data-Persistence, Infrastructure-Foundations, netops, SRE
ABran-WMF claimed T370394: Drop gb_by from globalblocks table.
Fri, Jul 19, 7:04 AM · Data-Engineering, Data Products, Schema-change-in-production, DBA

Thu, Jul 18

ABran-WMF added a comment to T365998: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 -lsw1-f3-eqiad .

data-persistence hosts handled, ready whenever you are @cmooney

Thu, Jul 18, 2:49 PM · SRE-swift-storage, DBA, Data-Persistence, Infrastructure-Foundations, netops, SRE
ABran-WMF closed T367496: MySQL_legacy Spicerack - fixes as Resolved.

Since https://github.com/wikimedia/operations-software-spicerack/blob/v8.8.0/CHANGELOG.rst has been tested, this can be considered quite done, only remains the improvement of exception carrying when a SQL command fails on a host, tracked in T370419

Thu, Jul 18, 2:38 PM · DBA
ABran-WMF triaged T370419: Improve Exceptions on command failure as Low priority.
Thu, Jul 18, 2:37 PM · DBA
ABran-WMF created T370419: Improve Exceptions on command failure.
Thu, Jul 18, 2:36 PM · DBA
ABran-WMF changed the status of T370304: Exception caught inside exception handler: Wikimedia\Rdbms\DBUnexpectedError: Database servers in extension1 are overloaded. from Open to In Progress.
Thu, Jul 18, 6:15 AM · Content-Transform-Team-WIP, MW-1.43-notes (1.43.0-wmf.15; 2024-07-23), User-notice, Wikimedia-Incident, DBA, Wikimedia-production-error
ABran-WMF changed the status of T370265: Create new translate_cache table on Wikimedia wikis with the Translate extension installed from Open to In Progress.
Thu, Jul 18, 6:15 AM · Data-Persistence, LPL Essential (LPL Essential 2024 Jul-Sep), MediaWiki-extensions-Translate

Wed, Jul 17

ABran-WMF changed the status of T367279: Migrate mysql icinga alerts to alert manager - seconds_behind_master + threads (replication/io), a subtask of T315866: Migrate mysql icinga alerts to alert manager, from Open to In Progress.
Wed, Jul 17, 3:34 PM · Patch-For-Review, DBA
ABran-WMF changed the status of T367279: Migrate mysql icinga alerts to alert manager - seconds_behind_master + threads (replication/io) from Open to In Progress.
Wed, Jul 17, 3:34 PM · Patch-For-Review, DBA
ABran-WMF updated the task description for T367279: Migrate mysql icinga alerts to alert manager - seconds_behind_master + threads (replication/io).
Wed, Jul 17, 3:34 PM · Patch-For-Review, DBA
ABran-WMF added a comment to T369855: db1179 crashed - hardware issues.

sure thing!

Wed, Jul 17, 2:54 PM · SRE, DC-Ops, ops-eqiad, DBA
ABran-WMF closed T367280: Migrate mysql icinga alerts to alert manager - memory pressure as Declined.

Could be considered as redundant of T367283 which would offer a more specific angle.

Wed, Jul 17, 2:13 PM · Patch-For-Review, DBA
ABran-WMF closed T367280: Migrate mysql icinga alerts to alert manager - memory pressure, a subtask of T315866: Migrate mysql icinga alerts to alert manager, as Declined.
Wed, Jul 17, 2:13 PM · Patch-For-Review, DBA
ABran-WMF added a comment to T370029: cumin2002 db-switchover debug.

done!

Wed, Jul 17, 1:07 PM · DBA
ABran-WMF added a comment to T370029: cumin2002 db-switchover debug.

I've updated

  • dbstore1009
  • db1164 (m1)
  • db1176 (m5)
Wed, Jul 17, 1:04 PM · DBA
ABran-WMF added a comment to T370029: cumin2002 db-switchover debug.

The issue is due to the fact that cumin2002 has python3-wmfmariadbpy at version 0.10 while cumin1002 and most of the host with that package has version 0.11.2.
[...] how do you manage the versioning and upgrade of this package?
Full list of hosts with the old version available in Debmonitor: https://debmonitor.wikimedia.org/packages/python3-wmfmariadbpy

Wed, Jul 17, 1:01 PM · DBA
ABran-WMF updated the task description for T367781: Drop deprecated abuse filter fields on wmf wikis.
Wed, Jul 17, 10:00 AM · Schema-change-in-production, Data-Engineering, DBA
ABran-WMF updated subscribers of T369855: db1179 crashed - hardware issues.

This server has been down for a few days, @wiki_willy please let me know if I can help

Wed, Jul 17, 6:41 AM · SRE, DC-Ops, ops-eqiad, DBA
ABran-WMF added a comment to T367781: Drop deprecated abuse filter fields on wmf wikis.

thanks! will roll the change there as well

Wed, Jul 17, 6:32 AM · Schema-change-in-production, Data-Engineering, DBA

Tue, Jul 16

ABran-WMF added a comment to T365997: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 -lsw1-f2-eqiad .

dbstore1009 has replication up to date on all 3 instances

Tue, Jul 16, 3:30 PM · SRE-swift-storage, DBA, Data-Persistence, Infrastructure-Foundations, netops, SRE
ABran-WMF added a comment to T370029: cumin2002 db-switchover debug.

We could probably try and rely on read_default_file like we do in other tools

Tue, Jul 16, 1:50 PM · DBA
ABran-WMF updated the task description for T367781: Drop deprecated abuse filter fields on wmf wikis.
Tue, Jul 16, 8:12 AM · Schema-change-in-production, Data-Engineering, DBA
ABran-WMF updated the task description for T367781: Drop deprecated abuse filter fields on wmf wikis.
Tue, Jul 16, 7:24 AM · Schema-change-in-production, Data-Engineering, DBA
ABran-WMF updated the task description for T367781: Drop deprecated abuse filter fields on wmf wikis.
Tue, Jul 16, 7:15 AM · Schema-change-in-production, Data-Engineering, DBA

Mon, Jul 15

ABran-WMF created P66481 testing puppet.
Mon, Jul 15, 1:13 PM
ABran-WMF triaged T368874: Productionize dbproxy102[89] as Medium priority.
Mon, Jul 15, 12:48 PM · Patch-For-Review, DBA
ABran-WMF triaged T370029: cumin2002 db-switchover debug as Medium priority.

debug is already in progress, this task is to track what's been done

Mon, Jul 15, 10:13 AM · DBA
ABran-WMF created T370029: cumin2002 db-switchover debug.
Mon, Jul 15, 10:13 AM · DBA
ABran-WMF added a comment to T362529: Create a Wikimedians of United Arab Emirates User Group Wiki.

@ABran-WMF, @fnegri - I believe that we are ready to run the sre.wikireplicas.add-wiki cookbook for this wiki, which should make it available on both clouddb* and an-redacteddb1001 hosts.
Are you happy for us (myself and @Stevemunene ) to run that now, or is there anything else that either of you feels need be done first?

Mon, Jul 15, 10:12 AM · MW-1.43-notes (1.43.0-wmf.15; 2024-07-23), Data-Services, Patch-For-Review, Wiki-Setup (Create)
ABran-WMF moved T362824: Q#:rack/setup/install dbproxy200[5-8] from Triage to In progress on the DBA board.
Mon, Jul 15, 7:17 AM · DBA, SRE, ops-codfw, Data-Persistence, DC-Ops

Fri, Jul 12

ABran-WMF updated the task description for T367781: Drop deprecated abuse filter fields on wmf wikis.
Fri, Jul 12, 1:27 PM · Schema-change-in-production, Data-Engineering, DBA
ABran-WMF added a comment to T367781: Drop deprecated abuse filter fields on wmf wikis.

execution collided with T367856 on s7, stopped and repooling will resume monday.

Fri, Jul 12, 9:11 AM · Schema-change-in-production, Data-Engineering, DBA
ABran-WMF added a project to T369855: db1179 crashed - hardware issues: ops-eqiad.

I am unable to reach it via management interface either, it might need a bit of hands on

Fri, Jul 12, 9:01 AM · SRE, DC-Ops, ops-eqiad, DBA
ABran-WMF changed the status of T369855: db1179 crashed - hardware issues from Open to In Progress.
Fri, Jul 12, 8:51 AM · SRE, DC-Ops, ops-eqiad, DBA

Thu, Jul 11

ABran-WMF awarded T362893: Spicerack support for dbctl a Love token.
Thu, Jul 11, 3:56 PM · DBA, Infrastructure-Foundations, conftool, Spicerack, SRE-tools
ABran-WMF added a comment to T365996: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 - lsw1-f1-eqiad .

dbhost repooling
dbproxy reloaded
backuphost checked and looks green

Thu, Jul 11, 2:47 PM · SRE-swift-storage, DBA, Data-Persistence, Infrastructure-Foundations, netops, SRE
ABran-WMF awarded T369720: Clean pt-heartbeat from read only external store nodes a Party Time token.
Thu, Jul 11, 10:27 AM · DBA
ABran-WMF added a project to T362529: Create a Wikimedians of United Arab Emirates User Group Wiki: Data-Services.

@Zabe it seems we were missing the "storage layer" task we usually get. Anyway, this is done on our side.

Thu, Jul 11, 7:47 AM · MW-1.43-notes (1.43.0-wmf.15; 2024-07-23), Data-Services, Patch-For-Review, Wiki-Setup (Create)

Wed, Jul 10

ABran-WMF added a comment to T365993: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 - lsw1-e1-eqiad.

db1190 repooling
dbproxy reloaded

Wed, Jul 10, 3:46 PM · SRE-swift-storage, DBA, Data-Persistence, Infrastructure-Foundations, netops, SRE
ABran-WMF triaged T369720: Clean pt-heartbeat from read only external store nodes as Medium priority.
Wed, Jul 10, 1:57 PM · DBA
ABran-WMF created T369720: Clean pt-heartbeat from read only external store nodes.
Wed, Jul 10, 1:56 PM · DBA
ABran-WMF closed T367278: Migrate mysql icinga alerts to alert manager - pt-heartbeat + scaffolding as Resolved.

This is done, we'll iterate and monitor elsewhere if needed.

Wed, Jul 10, 1:40 PM · Patch-For-Review, DBA
ABran-WMF closed T367278: Migrate mysql icinga alerts to alert manager - pt-heartbeat + scaffolding, a subtask of T315866: Migrate mysql icinga alerts to alert manager, as Resolved.
Wed, Jul 10, 1:39 PM · Patch-For-Review, DBA
ABran-WMF added a comment to T367278: Migrate mysql icinga alerts to alert manager - pt-heartbeat + scaffolding.

Exporter new configuration has been merged, it's rolling out on production. Rules have been merged as well

Wed, Jul 10, 1:37 PM · Patch-For-Review, DBA
ABran-WMF changed the status of T369715: Gather all mariadb host under the same prometheus label from Open to In Progress.
Wed, Jul 10, 1:23 PM · Observability-Metrics, Observability-Alerting, DBA
ABran-WMF placed T369715: Gather all mariadb host under the same prometheus label up for grabs.
Wed, Jul 10, 1:23 PM · Observability-Metrics, Observability-Alerting, DBA
ABran-WMF created T369715: Gather all mariadb host under the same prometheus label.
Wed, Jul 10, 1:22 PM · Observability-Metrics, Observability-Alerting, DBA
ABran-WMF moved T369654: Q1:rack/setup/install db22[21-40] from Triage to Blocked on the DBA board.
Wed, Jul 10, 8:23 AM · DBA, SRE, ops-codfw, Data-Persistence, DC-Ops
ABran-WMF moved T369658: Q1:rack/setup/install pc2017 from Triage to Blocked on the DBA board.
Wed, Jul 10, 8:21 AM · DBA, SRE, Data-Persistence, ops-codfw, DC-Ops
ABran-WMF placed T369661: Q1:rack/setup/install pc1017 up for grabs.
Wed, Jul 10, 8:20 AM · DBA, SRE, Data-Persistence, ops-eqiad, DC-Ops
ABran-WMF claimed T369661: Q1:rack/setup/install pc1017.
Wed, Jul 10, 8:19 AM · DBA, SRE, Data-Persistence, ops-eqiad, DC-Ops

Mon, Jul 8

ABran-WMF closed T367281: Migrate mysql icinga alerts to alert manager - disk pressure as Resolved.
Mon, Jul 8, 1:36 PM · Patch-For-Review, DBA
ABran-WMF closed T367281: Migrate mysql icinga alerts to alert manager - disk pressure, a subtask of T315866: Migrate mysql icinga alerts to alert manager, as Resolved.
Mon, Jul 8, 1:35 PM · Patch-For-Review, DBA
ABran-WMF awarded T368354: Modify db-mysql to connect to an-redacteddb1001 from cumin hosts a Party Time token.
Mon, Jul 8, 9:59 AM · Patch-For-Review, Data-Services, Data-Persistence, Data-Platform-SRE (2024.06.17 - 2024.07.07)
ABran-WMF added a comment to T368354: Modify db-mysql to connect to an-redacteddb1001 from cumin hosts.

we've not seen any regression since you released the update, I think you're good to go!

Mon, Jul 8, 6:37 AM · Patch-For-Review, Data-Services, Data-Persistence, Data-Platform-SRE (2024.06.17 - 2024.07.07)

Fri, Jul 5

ABran-WMF added a comment to T367278: Migrate mysql icinga alerts to alert manager - pt-heartbeat + scaffolding.

We could also go for Misc hosts in codfw as testing.

Fri, Jul 5, 2:15 PM · Patch-For-Review, DBA
ABran-WMF added a comment to T367781: Drop deprecated abuse filter fields on wmf wikis.

will do!

Fri, Jul 5, 12:31 PM · Schema-change-in-production, Data-Engineering, DBA
ABran-WMF claimed T239814: Automate DB upgrades.
Fri, Jul 5, 12:26 PM · User-Ladsgroup, DBA
ABran-WMF updated subscribers of T367278: Migrate mysql icinga alerts to alert manager - pt-heartbeat + scaffolding.

@fgiunchedi fyi I've started rolling out the backport version on clouddb hosts, I've left aside clouddb1019 as @fnegri told me it was kind of unstable recently. I'll let it sit for the week-end before merging anything and go for the remaining hosts that need that version installed. @jcrespo don't worry about this change, it's very safe and monitored on my side, it'll cover backup hosts that have mysqld-exporter installed as well.

Fri, Jul 5, 8:54 AM · Patch-For-Review, DBA

Thu, Jul 4

ABran-WMF renamed T369295: Cookbook to create a sanitarium host from scratch from Cookbook to create a sanitarium master from scratch to Cookbook to create a sanitarium host from scratch.
Thu, Jul 4, 2:44 PM · DBA
fnegri awarded T369295: Cookbook to create a sanitarium host from scratch a Unicorn! token.
Thu, Jul 4, 2:20 PM · DBA
ABran-WMF changed the subtype of T369295: Cookbook to create a sanitarium host from scratch from "Feature Request" to "Task".
Thu, Jul 4, 2:19 PM · DBA
ABran-WMF changed the status of T369295: Cookbook to create a sanitarium host from scratch from Open to In Progress.
Thu, Jul 4, 2:18 PM · DBA
ABran-WMF changed the status of T369295: Cookbook to create a sanitarium host from scratch, a subtask of T362893: Spicerack support for dbctl, from Open to In Progress.
Thu, Jul 4, 2:18 PM · DBA, Infrastructure-Foundations, conftool, Spicerack, SRE-tools
ABran-WMF added a parent task for T369295: Cookbook to create a sanitarium host from scratch: T362893: Spicerack support for dbctl.
Thu, Jul 4, 2:18 PM · DBA
ABran-WMF added a subtask for T362893: Spicerack support for dbctl: T369295: Cookbook to create a sanitarium host from scratch.
Thu, Jul 4, 2:18 PM · DBA, Infrastructure-Foundations, conftool, Spicerack, SRE-tools
ABran-WMF created T369295: Cookbook to create a sanitarium host from scratch.
Thu, Jul 4, 2:17 PM · DBA
ABran-WMF added a comment to T368354: Modify db-mysql to connect to an-redacteddb1001 from cumin hosts.

I'm affraid thats an answer I don't have @BTullis maybe @Marostegui or @Ladsgroup knows.

Thu, Jul 4, 2:11 PM · Patch-For-Review, Data-Services, Data-Persistence, Data-Platform-SRE (2024.06.17 - 2024.07.07)
ABran-WMF added a comment to T368354: Modify db-mysql to connect to an-redacteddb1001 from cumin hosts.

Amazing 🎉 lets maye try the first deployment from a canary cumin host so we're 100% sure that there is no breaking change.

Thu, Jul 4, 12:27 PM · Patch-For-Review, Data-Services, Data-Persistence, Data-Platform-SRE (2024.06.17 - 2024.07.07)
ABran-WMF added a comment to T368354: Modify db-mysql to connect to an-redacteddb1001 from cumin hosts.

Ah @BTullis I see the issue you face, I had the same one, sorry for not spotting it sooner!

Thu, Jul 4, 10:06 AM · Patch-For-Review, Data-Services, Data-Persistence, Data-Platform-SRE (2024.06.17 - 2024.07.07)
Ladsgroup awarded T369250: db1213 InnoDB errors a Heartbreak token.
Thu, Jul 4, 8:47 AM · DBA
ABran-WMF triaged T369252: monitoring - MariaDB log parsing and log alerting as Medium priority.
Thu, Jul 4, 7:44 AM · DBA
ABran-WMF moved T369252: monitoring - MariaDB log parsing and log alerting from Triage to Ready on the DBA board.
Thu, Jul 4, 7:29 AM · DBA
ABran-WMF created T369252: monitoring - MariaDB log parsing and log alerting.
Thu, Jul 4, 7:28 AM · DBA
ABran-WMF changed the status of T369250: db1213 InnoDB errors from Open to In Progress.
Thu, Jul 4, 7:17 AM · DBA
ABran-WMF created T369250: db1213 InnoDB errors.
Thu, Jul 4, 7:16 AM · DBA

Wed, Jul 3

ABran-WMF added a comment to T365994: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 - lsw1-e2-eqiad.

db hosts as well, repooling

Wed, Jul 3, 2:25 PM · SRE-swift-storage, DBA, Data-Persistence, Infrastructure-Foundations, netops, SRE
ABran-WMF updated subscribers of T368354: Modify db-mysql to connect to an-redacteddb1001 from cumin hosts.

@Marostegui yep no worries! @BTullis you can remove and create dgit/$distro-wikimedia branches no problem. Our current packages have their version known and we can always revert to that if needed to. The specific instructions are here. @MatthewVernon has produced an extensive doc which I tried to follow and proof check last time I went through it. Please let me know if there is a missing link or component in that chapter that would help. Otherwise I'll just add a header saying that its OK to have to delete/overwrite dgit/$branch!

Wed, Jul 3, 12:52 PM · Patch-For-Review, Data-Services, Data-Persistence, Data-Platform-SRE (2024.06.17 - 2024.07.07)
ABran-WMF added a comment to T362509: Setup new dbprov hosts and decommission the old ones.

noted, thanks @jcrespo !

Wed, Jul 3, 12:04 PM · Patch-For-Review, database-backups, Data-Persistence-Backup
ABran-WMF claimed T368881: Make sre.mysql.clone handling pooling/depooling.
Wed, Jul 3, 6:18 AM · DBA
ABran-WMF moved T368881: Make sre.mysql.clone handling pooling/depooling from Triage to Ready on the DBA board.
Wed, Jul 3, 6:17 AM · DBA

Tue, Jul 2

ABran-WMF updated subscribers of T369045: Migrate mysql icinga alerts to alert manager - haproxy exporter.
Tue, Jul 2, 1:32 PM · DBA
ABran-WMF added a parent task for T369045: Migrate mysql icinga alerts to alert manager - haproxy exporter: T321808: Port most/all Icinga checks to Prometheus/Alertmanager.
Tue, Jul 2, 1:31 PM · DBA
ABran-WMF added a subtask for T321808: Port most/all Icinga checks to Prometheus/Alertmanager: T369045: Migrate mysql icinga alerts to alert manager - haproxy exporter.
Tue, Jul 2, 1:31 PM · SRE Observability (FY2024/2025-Q1), Observability-Alerting
ABran-WMF updated the task description for T369045: Migrate mysql icinga alerts to alert manager - haproxy exporter.
Tue, Jul 2, 1:31 PM · DBA
ABran-WMF changed the status of T369045: Migrate mysql icinga alerts to alert manager - haproxy exporter from Open to In Progress.
Tue, Jul 2, 1:31 PM · DBA
ABran-WMF created T369045: Migrate mysql icinga alerts to alert manager - haproxy exporter.
Tue, Jul 2, 1:30 PM · DBA
ABran-WMF added a comment to T368098: Dumps generation without prefetch cause disruption to the production environment.

Today and yesterday we had another event of critical alarming firing for that reason

Tue, Jul 2, 12:15 PM · Dumps 2.0, MW-1.43-notes (1.43.0-wmf.11; 2024-06-25), Patch-For-Review, Dumps-Generation, SRE

Mon, Jul 1

ABran-WMF claimed T368874: Productionize dbproxy102[89].
Mon, Jul 1, 1:49 PM · Patch-For-Review, DBA
ABran-WMF claimed T368401: Switchover es6 master (es1037 -> es1038).
Mon, Jul 1, 1:38 PM · DBA
ABran-WMF raised the priority of T362893: Spicerack support for dbctl from Low to Medium.
Mon, Jul 1, 12:55 PM · DBA, Infrastructure-Foundations, conftool, Spicerack, SRE-tools
ABran-WMF added a comment to T367282: Migrate mysql icinga alerts to alert manager - read only status.

I limited myself to warning states on the first PS → I can trigger a page for that context indeed, will iterate

Mon, Jul 1, 8:59 AM · Patch-For-Review, DBA
ABran-WMF added a comment to T368881: Make sre.mysql.clone handling pooling/depooling.

This will be unlocked by T362893

Mon, Jul 1, 8:38 AM · DBA
ABran-WMF moved T362893: Spicerack support for dbctl from Triage to Ready on the DBA board.
Mon, Jul 1, 8:36 AM · DBA, Infrastructure-Foundations, conftool, Spicerack, SRE-tools
ABran-WMF added a project to T362893: Spicerack support for dbctl: DBA.
Mon, Jul 1, 8:36 AM · DBA, Infrastructure-Foundations, conftool, Spicerack, SRE-tools
ABran-WMF added a subtask for T362893: Spicerack support for dbctl: T368881: Make sre.mysql.clone handling pooling/depooling.
Mon, Jul 1, 8:36 AM · DBA, Infrastructure-Foundations, conftool, Spicerack, SRE-tools
ABran-WMF added a parent task for T368881: Make sre.mysql.clone handling pooling/depooling: T362893: Spicerack support for dbctl.
Mon, Jul 1, 8:36 AM · DBA

Thu, Jun 27

ABran-WMF lowered the priority of T367283: Migrate mysql icinga alerts to alert manager - process monitoring from Medium to Low.
Thu, Jun 27, 2:58 PM · Patch-For-Review, DBA