-
Notifications
You must be signed in to change notification settings - Fork 0
2024-11-07 - Green Software Playbooks agenda #18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
We want to differentiate between improvements to existing projects and creating a new project or adding features.
|
Write some first draft for the following best practice idea: Identification and classification of all existing data (e.g. also data in sandboxes, old archives, test data, ...) -> deletion of all identified ROT (redundant, outdated, trivial) and generally unnecessary data (Franziska) |
I will write some instructions for trying to keep data storage co-located with the processing and retrieval (in other words, use data centers nearest to where it is evaluated) @moin-oss |
First idea for instructions for the best practice "Identification and classification of all existing data -> deletion of all identified unnecessary data": Green Software Playbooks – Data EngineeringImprovements to existing projectsIdentify and classify all existing data and later delete all identified unnecessary dataGo over all available data storage in your project and document and classify existing data.
Permanently delete all identified ROT and unneeded data from your data storage. If necessary adjust the corresponding loads to avoid the creation of new ROT data. Green IT Advantages: This helps you reduce the amount of data in your project and helps you avoid the existence of dark data. About 10% to 1/3 of the energy in data centres is used for data storage, so you can reduce your CO2 footprint, energy consumption and also cost by reducing the amount of data stored. Considerations during setup of a new projectIdentify and classify all existing dataEstablish a process from the start to keep track of all the data in any data storage of your new project, e.g. by having an analysis task which will run regularly and go through all your stored data. Green IT Advantages: This helps you reduce the amount of data in your project and helps you avoid the existence of dark data. About 10% to 1/3 of the energy in data centres is used for data storage, so you can reduce your CO2 footprint, energy consumption and also cost by reducing the amount of data stored. |
Draft set of directions for developers to look at the locations of their databases and processing servers: Green Software Playbooks – Data EngineeringImprovements to existing projectsUsing servers for processing in the same data center as the databasesExamine the existing arrangement of your project to determine if the databases are in the same data center as the servers that handle the processing of the data.
Once the servers and the databases are setup to run in the same datacenter there should not be any modifications needed moving forward. Green IT Advantages: As of 2017, the energy rate for transferring data over the internet amounted to 1.8kWh/GB https://www.wholegraindigital.com/blog/website-energy-consumption/. Given the often large quantities of data that are processed by event or data driven applications, this can amount to a significant amount of energy required to transfer data from databases to servers for processing. By keeping databases and servers within the same data center, the energy demand for data processing can be significantly reduced. Considerations during setup of a new projectUsing servers for processing in the same data center as the databasesWhen creating a new project the easiest solution is to use the same cloud provider for both databases and servers so that they can be both run in the same data center. Otherwise, it will be important to talk discuss with prospective providers the locations of their respective data centers so that the databases and servers are not too far apart. Green IT Advantages: As of 2017, the energy rate for transferring data over the internet amounted to 1.8kWh/GB https://www.wholegraindigital.com/blog/website-energy-consumption/. Given the often large quantities of data that are processed by event or data driven applications, this can amount to a significant amount of energy required to transfer data from databases to servers for processing. By keeping databases and servers within the same data center, the energy demand for data processing can be significantly reduced. |
Date
2024-11-07 - HH:MM UTC - See the time in your timezone https://everytimezone.com
Roll Call
Please add a comment to this issue during the meeting to denote attendance.
Any untracked attendees will be added by the GSF team below:
Previous Meeting
Notes from the previous meeting: ...
Agenda
OKR & KPI updates
Any Other Business
The text was updated successfully, but these errors were encountered: