Releases: AliceO2Group/Control
v1.27.0
This release includes several fixes and improvements to error reporting, as well as the propagation of pdp_beam_type
to ODC tasks.
-
Fixes:
- [core] protect from accessing nil t.Task
- [core] fixing and refactoring monitoring
- [core] fix nil task access
-
Error reporting:
- [core] improvements in error reporting during env deployment and configuration
- [core] do not warn about tasks not in roster when it's expected
- [core][executor] fine-tuning operator logs
- OCTRL-951 [core][executor] improve reporting error transition
- OCTRL-900 [core] demote getConfig warnings containing debug information
- OCTRL-759 [core] clearer deployment failure logs for OPS
-
ODC tasks communication:
- [core] propagate pdp_beam_type as FairMQ property to ODC tasks
-
Documentation:
- Update operation_order.md with trg.RunStop change
v1.26.0
This release includes a new metrics publishing endpoint, a new sort function in the workflow template system to sort JSON lists, and miscellaneous fixes.
-
Metrics:
- [core] Adding http metrics endpoint
- fixup! [core] Adding http metrics endpoint
-
JSON list sorting:
- [core] Add json.Sort function to workflow template context
- [core] Sort detector list before including in ODC payload
-
Miscellaneous:
- [build] Fix make fdset command to include common.proto
- [occ] do not call iterateCheck when in ERROR
- Bump golang.org/x/crypto from 0.21.0 to 0.31.0
v1.25.0
This release includes fixes to events emitted to Kafka, improved handling of controlled node unreachable conditions, and documentation improvements.
-
Events:
- [core] Report to Kafka source state if transition failed
- [core] Handle gRPC code DeadlineExceeded in DCS client
-
Handling of node unreachable:
- OCTRL-949 [core] Improve reaction to controlled nodes becoming unreachable
-
Documentation:
- [build] Fix protoc call for o2control.proto docs generation
- [build] Regenerate apidocs
- [coconut] Improve role query command documentation
v1.24.0
This release moves FairMQ sockets into the abstract namespace to avoid polluting /tmp
, and fixes a core stuck issue.
-
FairMQ abstract namespace:
- [core] Pass to FairMQ tasks abstract namespace endpoint paths
-
Core stuck fix:
- [core] avoid stuck updateTaskStatus due to mulitple mesos updates
v1.23.2
v1.23.1
v1.23.0
This release improves timeout handling in the DD scheduler and TRG clients.
-
DD scheduler:
- [core] Increase default DD scheduler gRPC timeout
- [core] Make new ddsched timeout only apply to GetData status calls
- [core] Remove RPC interceptor
-
TRG:
- [core] Ensure TRG timeouts are obeyed + add polling timeout
v1.22.1
v1.22.0
This release includes a fix for a deployment failure, and improves communication with the DD scheduler by adding a timeout.
-
Deployment failure:
- fix error handling in case of no resources error
- Revert "use two channel to communicate mesos REVIVE"
-
DD scheduler timeout:
- [core] ddsched plugin gRPC calls have timeout by default