fnegri (Francesco Negri)
Site Reliability Engineer

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Saturday

  • Clear sailing ahead.

User Details

User Since
Jul 18 2022, 2:39 PM (105 w, 3 d)
Availability
Available
IRC Nick
dhinus
LDAP User
FNegri
MediaWiki User
FNegri-WMF [ Global Accounts ]

Recent Activity

Today

fnegri added a comment to T326613: Database credentials for s51347 (fatg) publicly readable on Toolforge.

The code should also just read/parse from replica.my.cnf instead of copying them.

Thu, Jul 25, 4:09 PM · cloud-services-team (FY2023/2024-Q3-Q4), Vuln-Infoleak, SecTeam Discussion, Tools, Security
fnegri closed T326613: Database credentials for s51347 (fatg) publicly readable on Toolforge as Resolved.

The credentials have been rotated.

Thu, Jul 25, 4:05 PM · cloud-services-team (FY2023/2024-Q3-Q4), Vuln-Infoleak, SecTeam Discussion, Tools, Security
fnegri moved T370760: [toolsdb] ToolsToolsDBReplicationLagIsTooHigh - 2024-07-23 from Backlog to In progress on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Thu, Jul 25, 3:22 PM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services
fnegri added a comment to T326613: Database credentials for s51347 (fatg) publicly readable on Toolforge.

I don't think these credentials have ever been rotated, replica.my.cnf has a date in 2014 :)

Thu, Jul 25, 3:02 PM · cloud-services-team (FY2023/2024-Q3-Q4), Vuln-Infoleak, SecTeam Discussion, Tools, Security
fnegri closed T368394: MetricsinfraAlertmanagerDown as Resolved.
Thu, Jul 25, 12:47 PM · cloud-services-team
fnegri added a comment to T371011: toolforge: an account don't have db access even though maintain-dbusers claims it has created everything.

Connecting to ToolsDB with the password stored in /data/project/lexica-tool/replica.my.cnf failed with:

ERROR 1045 (28000): Access denied for user 's56035'@'localhost' (using password: YES)
Thu, Jul 25, 12:20 PM · cloud-services-team
fnegri added a comment to T371011: toolforge: an account don't have db access even though maintain-dbusers claims it has created everything.

I can see the user and grants in ToolsDB:

Thu, Jul 25, 12:01 PM · cloud-services-team
valerio.bozzolan awarded T370760: [toolsdb] ToolsToolsDBReplicationLagIsTooHigh - 2024-07-23 a Fox token.
Thu, Jul 25, 11:34 AM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services
fnegri added a comment to T370760: [toolsdb] ToolsToolsDBReplicationLagIsTooHigh - 2024-07-23.

Unfortunately there was one wrong assumption in my calculations above: the number of row lock(s) reported by SHOW ENGINE INNODB STATUS does not match the actual number of rows to be deleted. The number of row locks keeps on increasing, maybe because rows are locked incrementally. I assume the second number (undo log entries) does show the correct number of processed rows, and that has now increased from 1299525 to 1500150. The total number it has to reach is not 1315111 as I incorrectly assumed yesterday, but I believe is the total number of rows in the table:

Thu, Jul 25, 8:50 AM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services

Yesterday

fnegri moved T291782: Migrate largest ToolsDB users to Trove from Inbox to Epics on the cloud-services-team board.
Wed, Jul 24, 4:15 PM · cloud-services-team, Epic, Data-Services
fnegri edited projects for T291782: Migrate largest ToolsDB users to Trove, added: cloud-services-team; removed cloud-services-team (FY2024/2025-Q1-Q2).
Wed, Jul 24, 4:15 PM · cloud-services-team, Epic, Data-Services
fnegri added a comment to T291782: Migrate largest ToolsDB users to Trove.

This is an Epic that will require multiple quarters. I'm removing the milestone cloud-services-team (FY2024/2025-Q1-Q2) from this task, and adding that milestone to two subtasks of this one that I can realistically start in this quarter:

Wed, Jul 24, 4:14 PM · cloud-services-team, Epic, Data-Services
fnegri moved T369177: [toolsdb] Migrate quickstatements db to Trove from FY2023/2024-Q3-Q4 to FY2024/2025-Q1-Q2 on the cloud-services-team board.
Wed, Jul 24, 4:11 PM · cloud-services-team (FY2024/2025-Q1-Q2), Data-Services, Goal
fnegri moved T350862: [toolsdb] Migrate mixnmatch db to Trove from FY2023/2024-Q3-Q4 to FY2024/2025-Q1-Q2 on the cloud-services-team board.
Wed, Jul 24, 4:11 PM · cloud-services-team (FY2024/2025-Q1-Q2), Goal, Data-Services
fnegri edited projects for T326613: Database credentials for s51347 (fatg) publicly readable on Toolforge, added: cloud-services-team (FY2023/2024-Q3-Q4); removed cloud-services-team.
Wed, Jul 24, 4:09 PM · cloud-services-team (FY2023/2024-Q3-Q4), Vuln-Infoleak, SecTeam Discussion, Tools, Security
fnegri edited projects for T370760: [toolsdb] ToolsToolsDBReplicationLagIsTooHigh - 2024-07-23, added: cloud-services-team (FY2023/2024-Q3-Q4); removed cloud-services-team.
Wed, Jul 24, 4:09 PM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services
fnegri added a comment to T370760: [toolsdb] ToolsToolsDBReplicationLagIsTooHigh - 2024-07-23.

Update: I think I found a way to estimate how long the transaction will take, thanks to this brilliant StackOverflow post.

Wed, Jul 24, 4:04 PM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services
fnegri triaged T370760: [toolsdb] ToolsToolsDBReplicationLagIsTooHigh - 2024-07-23 as High priority.

Replication is still stuck processing the same DELETE transaction. From past experience, it usually takes a few days to complete the transaction and catch up. I haven't yet found a way to estimate the time it's gonna take, as that can vary depending on the size of the transaction.

Wed, Jul 24, 3:28 PM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services
fnegri closed T291092: Update operational documentation for NFS and wikireplicas as Invalid.

Wiki Replicas docs improvements are tracked in this newer task: T365717: [wikireplicas] Update Admin docs

Wed, Jul 24, 2:55 PM · cloud-services-team, Documentation
fnegri renamed T306455: [toolsdb] convert myisam tables to innodb from toolsdb: reduce number of myisam tables to toolsdb: convert myisam tables to innodb.
Wed, Jul 24, 2:52 PM · Epic, cloud-services-team, Data-Services
fnegri renamed T306455: [toolsdb] convert myisam tables to innodb from toolsdb: convert myisam tables to innodb to [toolsdb] convert myisam tables to innodb.
Wed, Jul 24, 2:49 PM · Epic, cloud-services-team, Data-Services
fnegri closed T324629: toolsdb: What happens at Midnight? as Invalid.

This is not happening anymore.

Wed, Jul 24, 2:49 PM · cloud-services-team, Data-Services
fnegri added a project to T293804: [NFS] Add monitoring and alerting to the new NFS system: Toolforge.
Wed, Jul 24, 2:41 PM · Toolforge, cloud-services-team
fnegri added a comment to T306565: Forward ceph/octopus packages to Bullseye repo.

Upgrade to Ceph v16 is tracked in T306820: [ceph] Upgrade to v16

Wed, Jul 24, 2:28 PM · cloud-services-team
fnegri added a project to T320266: figure out deterministic way to tell if a rabbitmq cluster is paritioned: Cloud-VPS.
Wed, Jul 24, 2:21 PM · Cloud-VPS, cloud-services-team

Tue, Jul 23

fnegri added a comment to T367778: [wikireplicas] frequent replag spikes in clouddb hosts.

Replication lag on clouddb1019 (s4) remained at 0 until 11:25 UTC today, then it started increasing again.

Tue, Jul 23, 3:56 PM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri added a comment to T370732: PowerSupplyFailure .

The alert went back to green 1 minute after posting the comment above :)

Tue, Jul 23, 3:12 PM · SRE, DC-Ops, ops-codfw, cloud-services-team
fnegri added a project to T370732: PowerSupplyFailure : ops-codfw.

The same error happened last month (T368211), and was fixed by @Jhancock.wm by reseating the cable.

Tue, Jul 23, 3:08 PM · SRE, DC-Ops, ops-codfw, cloud-services-team
fnegri created P66893 (An Untitled Masterwork).
Tue, Jul 23, 11:15 AM
fnegri updated the task description for T357624: [toolsdb] Replica is frequently lagging behind the primary.
Tue, Jul 23, 10:33 AM · Data-Services
fnegri updated the task description for T357624: [toolsdb] Replica is frequently lagging behind the primary.
Tue, Jul 23, 10:33 AM · Data-Services
fnegri updated the task description for T370760: [toolsdb] ToolsToolsDBReplicationLagIsTooHigh - 2024-07-23.
Tue, Jul 23, 10:32 AM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services
fnegri added a comment to T370760: [toolsdb] ToolsToolsDBReplicationLagIsTooHigh - 2024-07-23.

I acked the alert for 24 hours, hopefully it will catch up by then.

Tue, Jul 23, 10:32 AM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services
fnegri changed the status of T370760: [toolsdb] ToolsToolsDBReplicationLagIsTooHigh - 2024-07-23 from Open to In Progress.
Tue, Jul 23, 10:30 AM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services
fnegri changed the status of T370760: [toolsdb] ToolsToolsDBReplicationLagIsTooHigh - 2024-07-23, a subtask of T357624: [toolsdb] Replica is frequently lagging behind the primary, from Open to In Progress.
Tue, Jul 23, 10:28 AM · Data-Services
fnegri created T370760: [toolsdb] ToolsToolsDBReplicationLagIsTooHigh - 2024-07-23.
Tue, Jul 23, 10:28 AM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services

Mon, Jul 22

fnegri updated subscribers of T367778: [wikireplicas] frequent replag spikes in clouddb hosts.

As an additional test, I killed all the remaining long queries for user s52168 (tools.kmlexport), and 2 long queries for user u1115 (@dschwen):

Mon, Jul 22, 5:26 PM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri attached a referenced file: F56600236: Screenshot 2024-07-22 at 19.11.32.png.
Mon, Jul 22, 5:25 PM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri added a comment to T367778: [wikireplicas] frequent replag spikes in clouddb hosts.

I spent some more time analyzing long queries running on clouddb1019. I found a few long queries for user s52168 (tools.kmlexport), and killing the 3 oldest ones did indeed cause the replication lag to stop growing and to start going down:

Mon, Jul 22, 5:11 PM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri updated the task description for T370652: tofu-infra: introduce additional gitlab-ci automation.
Mon, Jul 22, 3:36 PM · Cloud-VPS, User-aborrero, Epic, cloud-services-team
fnegri moved T297083: [ceph] Getting rack level HA from Backlog to Done on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Mon, Jul 22, 10:06 AM · cloud-services-team (FY2023/2024-Q3-Q4), Goal, DC-Ops, Cloud-Services-Worktype-Project, Cloud-Services-Origin-Team

Fri, Jul 19

fnegri added a comment to T370414: tofu-infra: create a cookbook automation to run tofu.

It seems tofu itself is protected against this. We may not need any lock in the cookbooks after all.

Fri, Jul 19, 12:00 PM · Cloud-VPS, User-aborrero, Epic, cloud-services-team
fnegri closed T370498: SystemdUnitDown Unit opentofu-infra-diff.service on node cloudcontrol1007 has been down for long. as Resolved.

Seems to be working now:

Fri, Jul 19, 9:45 AM · cloud-services-team
fnegri added a comment to T370414: tofu-infra: create a cookbook automation to run tofu.

Do we make use of the mentioned etcd backend from cloudcumin hosts?

Fri, Jul 19, 9:21 AM · Cloud-VPS, User-aborrero, Epic, cloud-services-team

Thu, Jul 18

fnegri added a comment to T367778: [wikireplicas] frequent replag spikes in clouddb hosts.

Back from my holidays, here's a glance of what happened since my last comment:

Thu, Jul 18, 5:18 PM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri added a comment to T314665: Toolforge: Replace all bastion with grid-less bookworm based bastion hosts.

I've added T360488: Missing Perl packages on dev.toolforge.org for anomiebot workflows as a subtask to remember that anomiebot is currently relying on the login-buster bastion, and if we remove it that tool is likely to break.

Thu, Jul 18, 4:23 PM · Toolforge (Toolforge iteration 13), User-dcaro, Patch-For-Review, cloud-services-team
fnegri added a subtask for T314665: Toolforge: Replace all bastion with grid-less bookworm based bastion hosts: T360488: Missing Perl packages on dev.toolforge.org for anomiebot workflows.
Thu, Jul 18, 4:20 PM · Toolforge (Toolforge iteration 13), User-dcaro, Patch-For-Review, cloud-services-team
fnegri added a parent task for T360488: Missing Perl packages on dev.toolforge.org for anomiebot workflows: T314665: Toolforge: Replace all bastion with grid-less bookworm based bastion hosts.
Thu, Jul 18, 4:20 PM · Toolforge, cloud-services-team
fnegri awarded Blog Post: Iterative Improvements a Love token.
Thu, Jul 18, 1:26 PM
fnegri renamed T370037: Cloud VPS: extend tofu-infra coverage from Cloud VPS: consider extending tofu-infra coverage to Cloud VPS: extend tofu-infra coverage.
Thu, Jul 18, 1:24 PM · Patch-For-Review, Cloud-VPS, User-aborrero, Epic, cloud-services-team
fnegri added a comment to T370414: tofu-infra: create a cookbook automation to run tofu.

We should also make use of Spicerack's locking functionality to prevent two people from running apply at the same time:
https://doc.wikimedia.org/spicerack/master/introduction.html#distributed-locking

Thu, Jul 18, 12:57 PM · Cloud-VPS, User-aborrero, Epic, cloud-services-team
fnegri added a comment to T370414: tofu-infra: create a cookbook automation to run tofu.

I would maybe force --branch main if you select --apply (or at least require --force to apply a different branch)

Thu, Jul 18, 12:55 PM · Cloud-VPS, User-aborrero, Epic, cloud-services-team

Wed, Jul 17

fnegri added a project to T370037: Cloud VPS: extend tofu-infra coverage: Cloud-VPS.
Wed, Jul 17, 2:49 PM · Patch-For-Review, Cloud-VPS, User-aborrero, Epic, cloud-services-team
fnegri added a comment to T179816: Cumin: create external backend for WMCS Puppet API.

to allow to query hosts also by their Puppet classes.

Wed, Jul 17, 2:22 PM · Cloud-VPS, Cumin, cloud-services-team, Infrastructure-Foundations
fnegri added a comment to T362629: Allow interacting with Toolforge PuppetDB from wmcs-cookbooks.

I just discovered that we do have dedicated Cumin instances for tools and toolsbeta, where /etc/cumin/config.yaml points to the respective puppetdb instead of the production one:

Wed, Jul 17, 1:44 PM · Cumin, cloud-services-team, Infrastructure-Foundations, Toolforge

Tue, Jul 16

fnegri added a comment to T362529: Create a Wikimedians of United Arab Emirates User Group Wiki.

@ABran-WMF, @fnegri - I believe that we are ready to run the sre.wikireplicas.add-wiki cookbook for this wiki, which should make it available on both clouddb* and an-redacteddb1001 hosts.

Tue, Jul 16, 2:11 PM · MW-1.43-notes (1.43.0-wmf.15; 2024-07-23), Data-Services, Patch-For-Review, Wiki-Setup (Create)
fnegri added a comment to T370127: Request new flavor for integration project.

+1

Tue, Jul 16, 2:02 PM · Cloud-VPS (Quota-requests)
fnegri moved T369172: toolforge: upgrade control plane nodes to k8s 1.25 from Next Up to Done on the Toolforge (Toolforge iteration 12) board.
Tue, Jul 16, 1:46 PM · Toolforge (Toolforge iteration 12), User-aborrero, cloud-services-team
fnegri moved T369171: toolforge: upgrade worker nodes to k8s 1.25 from Next Up to In Progress on the Toolforge (Toolforge iteration 12) board.
Tue, Jul 16, 1:46 PM · Toolforge (Toolforge iteration 12), User-aborrero, cloud-services-team
fnegri added a comment to T365424: Upgrade clouddb* hosts to Bookworm.

I think it makes sense to do your test first. I can change back the role before the reimage.

Tue, Jul 16, 10:43 AM · cloud-services-team (FY2024/2025-Q1-Q2), Data-Persistence, Data-Services
fnegri added a comment to T365424: Upgrade clouddb* hosts to Bookworm.

@BTullis I think you can proceed with your test and turn off all sections for a week. When that is done and you are confident nothing goes wrong as a result, I will proceed with the reimage. After the reimage is done, we can decommission it.

Tue, Jul 16, 10:38 AM · cloud-services-team (FY2024/2025-Q1-Q2), Data-Persistence, Data-Services
fnegri added a parent task for T365424: Upgrade clouddb* hosts to Bookworm: T368518: decommission clouddb1021.
Tue, Jul 16, 10:36 AM · cloud-services-team (FY2024/2025-Q1-Q2), Data-Persistence, Data-Services
fnegri added a subtask for T368518: decommission clouddb1021: T365424: Upgrade clouddb* hosts to Bookworm.
Tue, Jul 16, 10:36 AM · Data-Platform-SRE (2024.07.08 - 2024.07.28), decommission-hardware
fnegri added a comment to T315866: Migrate mysql icinga alerts to alert manager.

Adding a note to mention that currently Icinga alerts related to clouddb* hosts are getting tagged with team=wmcs when they are forwarded from icinga to alertmanager. I tried to figure out where that tagging happens but I haven't found it. We should aim to maintain that tagging when the alerts are migrated to alertmanager.

Tue, Jul 16, 10:26 AM · Patch-For-Review, DBA
fnegri moved T369723: Update all trove VMs to a modern guest image from Unsorted to Trove (DBaaS) on the Cloud-VPS board.
Tue, Jul 16, 9:50 AM · Goal, Data-Services, Cloud-VPS, cloud-services-team, User-Marostegui

Mon, Jul 15

fnegri added a project to T179816: Cumin: create external backend for WMCS Puppet API: Cloud-VPS.
Mon, Jul 15, 2:41 PM · Cloud-VPS, Cumin, cloud-services-team, Infrastructure-Foundations
fnegri closed T280152: Mitigate breaking changes from the new Wiki Replicas architecture as Resolved.

Marking this as Resolved as all the main subtasks have been completed. There are 4 subtasks left that are follow-ups to this work.

Mon, Jul 15, 9:59 AM · Data-Engineering-Icebox, cloud-services-team, Analytics-Radar, Data-Services
fnegri closed T280152: Mitigate breaking changes from the new Wiki Replicas architecture, a subtask of T215858: Plan a replacement for wiki replicas that is better suited to typical OLAP use cases than the MediaWiki OLTP schema, as Resolved.
Mon, Jul 15, 9:55 AM · Epic, cloud-services-team, Data-Engineering, Data-Services
fnegri removed projects from T298452: Some wikibase tables not available in commonswiki_p: cloud-services-team, Data-Services.
Mon, Jul 15, 9:51 AM · Data-Platform-SRE (2024.07.08 - 2024.07.28), Data-Engineering
fnegri removed a project from T217647: Table field templatelink.tl_from in database does not always match page.page_namespace: Data-Services.

I'm removing the Data-Services tag as this is not a problem with wikireplicas.

Mon, Jul 15, 9:48 AM · Wikimedia-database-issue (Bad data), MediaWiki-Page-derived-data
fnegri triaged T251801: Query is too slow ever since the migration to actor table as Low priority.
Mon, Jul 15, 9:43 AM · Data-Services
fnegri triaged T252356: Aggregate query on page, revision and langlinks takes a long time to run as Low priority.
Mon, Jul 15, 9:43 AM · Data-Services
fnegri triaged T252122: Optimize querying the page table by namespace as Low priority.
Mon, Jul 15, 9:42 AM · Data-Services
fnegri triaged T284948: Raw IPs of logged-out users disclosed in wiki-replicas as Low priority.

I am fine with having this ticket stalled until IP masking (T283177) is effective,

Mon, Jul 15, 9:34 AM · cloud-services-team, Data-Engineering, Privacy Engineering, Data-Services
fnegri added a subtask for T284948: Raw IPs of logged-out users disclosed in wiki-replicas: T324492: Temporary accounts - MVP.
Mon, Jul 15, 9:30 AM · cloud-services-team, Data-Engineering, Privacy Engineering, Data-Services
fnegri added a parent task for T324492: Temporary accounts - MVP: T284948: Raw IPs of logged-out users disclosed in wiki-replicas.
Mon, Jul 15, 9:30 AM · Epic, Temporary accounts
fnegri triaged T286328: Simple query scans entire revision table on new replicas as Low priority.
Mon, Jul 15, 9:26 AM · Data-Services
fnegri triaged T297026: Automate maintain-views workflow as Medium priority.
Mon, Jul 15, 9:17 AM · cloud-services-team, Patch-For-Review, Data-Services
fnegri triaged T344108: Add global_edit_count to wikireplicas as Medium priority.
Mon, Jul 15, 9:13 AM · Data-Platform, cloud-services-team, Patch-For-Review, Data-Services

Thu, Jul 4

fnegri triaged T369308: Decommission clouddb2002-dev.codfw.wmnet as Low priority.
Thu, Jul 4, 4:31 PM · Data-Persistence, cloud-services-team, Cloud-VPS
fnegri created T369308: Decommission clouddb2002-dev.codfw.wmnet.
Thu, Jul 4, 4:25 PM · Data-Persistence, cloud-services-team, Cloud-VPS
fnegri added a comment to T367393: Allow Superset to query ToolsDB public databases.

@KCVelaga_WMF thanks for testing! Yes it is kind of expected that all dbs are exposed, as in the SQL permissions are for all dbs ending in _p. So maybe to solve the problem of manually listing the dbs, we could just call the db "ToolsDB" in Superset, and ask users to start all their queries with use some_db_p;. What do you think?

Thu, Jul 4, 4:00 PM · cloud-services-team (FY2023/2024-Q3-Q4), superset.wmcloud.org
fnegri updated subscribers of T367393: Allow Superset to query ToolsDB public databases.

@Andrew @rook can you please review the PR at https://github.com/toolforge/superset-deploy/pull/26 and test if it works?

Thu, Jul 4, 2:24 PM · cloud-services-team (FY2023/2024-Q3-Q4), superset.wmcloud.org
fnegri awarded T369295: Cookbook to create a sanitarium host from scratch a Unicorn! token.
Thu, Jul 4, 2:20 PM · DBA
fnegri added a comment to T358774: [wmcs-backup] Backup snapshots of deleted volumes are never cleaned up.

Setting this to "Stalled" as I have too many things on my plate and this is not super urgent. I'd like to get back to it at some point but feel free to claim it if you are interested in this.

Thu, Jul 4, 2:16 PM · Cloud-VPS, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri changed the status of T358774: [wmcs-backup] Backup snapshots of deleted volumes are never cleaned up from In Progress to Stalled.
Thu, Jul 4, 2:16 PM · Cloud-VPS, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri moved T348407: Allow Quarry to query ToolsDB public databases from In progress to Done on the cloud-services-team (FY2023/2024-Q3-Q4) board.
Thu, Jul 4, 2:15 PM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services, Quarry
fnegri claimed T344599: wikireplicas root access.
Thu, Jul 4, 11:50 AM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services, Infrastructure Security

Wed, Jul 3

fnegri added a comment to T348407: Allow Quarry to query ToolsDB public databases.

Additional clean-up: I removed the grant for heartbeat_p as that is already implied in the grant for %\_p.

Wed, Jul 3, 4:58 PM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services, Quarry
fnegri added a comment to T367393: Allow Superset to query ToolsDB public databases.

I created the user and grants in ToolsDB, similar to the Quarry ones I created in T348407.

Wed, Jul 3, 4:57 PM · cloud-services-team (FY2023/2024-Q3-Q4), superset.wmcloud.org
fnegri added a comment to T367778: [wikireplicas] frequent replag spikes in clouddb hosts.

Two days later, it's still not looking great:

Wed, Jul 3, 4:36 PM · Data-Services, cloud-services-team (FY2023/2024-Q3-Q4)
fnegri added a project to T350862: [toolsdb] Migrate mixnmatch db to Trove: Goal.
Wed, Jul 3, 2:43 PM · cloud-services-team (FY2024/2025-Q1-Q2), Goal, Data-Services
fnegri removed a project from T291782: Migrate largest ToolsDB users to Trove: Goal.
Wed, Jul 3, 2:43 PM · cloud-services-team, Epic, Data-Services
fnegri triaged T301967: toolsdb: evaluate storage usage by some tools as Low priority.

DB storage is tracked in the subtask T291782: Migrate largest ToolsDB users to Trove

Wed, Jul 3, 2:38 PM · cloud-services-team, Toolforge, Data-Services
fnegri triaged T369177: [toolsdb] Migrate quickstatements db to Trove as Medium priority.
Wed, Jul 3, 2:35 PM · cloud-services-team (FY2024/2025-Q1-Q2), Data-Services, Goal
fnegri created T369177: [toolsdb] Migrate quickstatements db to Trove.
Wed, Jul 3, 2:21 PM · cloud-services-team (FY2024/2025-Q1-Q2), Data-Services, Goal
fnegri closed T368136: [wikireplicas] Make sure there is no sensitive data in clouddb hosts as Declined.

I highly doubt it'd be possible honestly for everything.

Wed, Jul 3, 1:50 PM · SRE, Data-Services, cloud-services-team
fnegri closed T368136: [wikireplicas] Make sure there is no sensitive data in clouddb hosts, a subtask of T344599: wikireplicas root access, as Declined.
Wed, Jul 3, 1:50 PM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Services, Infrastructure Security
fnegri added a comment to T344877: SQL function to recover the normal hostname, to install on Wiki Replica instances.

I don't know if MariaDB could allow a function being defined but not used in WHERE (and only in SELECT for example)

Wed, Jul 3, 1:40 PM · Data-Services

Tue, Jul 2

fnegri closed T368669: puppet-diffs quota request for buster migration as Resolved.

Increased quotas by 5 cores, 20 gigabytes, 1 instances, 12288 ram

Tue, Jul 2, 5:01 PM · Cloud-VPS (Quota-requests)