| Settings for experiments 116379752 |
| Started: Fri Dec 5 19:44:36 2025 -- up 1 hr 24 min 18 sec Built on Dec 5 2025 16:11:08 (1764979868) Built at rbex-enqueue-targets@oqs20.prod.google.com: Built as //cloud/ai/platform/dataplane/cardolan:vertex-genai-dataplane Build label: cloud-ml.vertex-genai-dataplane_20251205.04_p0 Built target blaze-out/k8-opt/bin/cloud/ai/platform/dataplane/cardolan/vertex-genai-dataplane Build options: fdo=XFDO Built for gcc-4.X.Y-crosstool-v18-llvm-grtev4-k8.k8 Built from changelist 840844128 with baseline 840844128 in a mint client based on //depot/google3 Task BNS: /bns/yudfwra/borg/yudfwra/bns/cloud-ml-vertex-genai-dataplane-staging-jobs/staging-qual-us.vertex-genai-dataplane/1 Ports: esf.1.staging-qual-us.vertex-genai-dataplane.cloud-ml-vertex-genai-dataplane-staging-jobs.yudfwra.borg.google.com:8111 1.staging-qual-us.vertex-genai-dataplane.cloud-ml-vertex-genai-dataplane-staging-jobs.yudfwra.borg.google.com:25948 borgenvelope.1.staging-qual-us.vertex-genai-dataplane.cloud-ml-vertex-genai-dataplane-staging-jobs.yudfwra.borg.google.com:25949 Profiling Links: censusprofilez?seconds=30 (CPU profile with go/census tags, 30 seconds): esf:8111 25948 borgenvelope:25949 censusheapz (heap usage with go/census tags): esf:8111 25948 borgenvelope:25949 peakheapz (peak heap usage): esf:8111 25948 borgenvelope:25949 deltacontentionz?seconds=10 (contention, 10 seconds): esf:8111 25948 borgenvelope:25949 threadz (thread stacks): esf:8111 25948 borgenvelope:25949 mmapz (mmap() usage): esf:8111 25948 borgenvelope:25949 contentionz (legacy contention): esf:8111 25948 borgenvelope:25949 Active RPC Experiments Stubby: stubby_default_subsetting, go/inline-psp, 1rpc, gRPC: channelz_use_v2_for_v1_api, channelz_use_v2_for_v1_service, channelz_zviz, chttp2_bound_write_size, deprecate_keep_grpc_initialized, error_flatten, event_engine_channelz_socket_info, event_engine_client, event_engine_dns, event_engine_dns_non_client_channel, event_engine_listener, event_engine_callback_cq, event_engine_secure_endpoint, google_no_envelope_resolver, graceful_external_connection_failure, lbns_support_in_address_resolver, loas2_protect_memory_optimization, max_inflight_pings_strict_limit, monitoring_experiment, namecheck_core_lib, privacy_context_single_encoding, prod2cloud_w3c_trace, rr_wrr_connect_from_random_index, tsi_frame_protector_without_locks, | Running on yudfwra-ca2.prod.google.com Platform: arcadia milan Process size: 13433MiB Memory usage: 2076MiB Load avg (1m): 50.86 View process information, endpoints View variables, flags, streamz, request logs Links: code, g3doc, continuous pprof, automon Distributed traces: view, change parameters Remote Logs: INFO WARNING ERROR STDOUT STDERR |
Name: GlobalDagRouting_RetriesConfig_Launch::Experiment::Dev Experiment group: LAUNCH Owner email: kevspace@google.com,luyaogong@google.com,tiancong@google.com Aggregate experiment id: 116379750 Description: Dev partition mendel source: http://google3/googledata/experiments/vertex_ai/dataplane_prediction_common/studies/GlobalDagRouting_RetriesConfig_Launch.gcl Query proportion: 0.00% Experiment layer: GlobalDagRoutingRetriesConfigLaunchLayer Control experiment id: 116379725 Phase: NORMAL Diversion key: 2 Status: ACTIVE Managed traffic allocation: NOT_MANAGED End date: never Condition index: 0
GlobalDagRouting__retries_config: 'name: "GlobalDagRouting__retries_config"
type: PROTO_BINARY_BASE64
sub_type: "cloud_ai_platform_dataplane_prediction_proto.CardolanRetriesConfig"
base_value: "config_id: \"ge_retries_config\"\nmodel_ids: \"gemini-2.0-flash-001\"\nmodel_ids: \"gemini-2.0-flash-lite-001\"\nmodel_ids: \"gemini-2.5-pro-preview\"\nmodel_ids: \"gemini-2.0-flash-preview\"\nmodel_ids: \"gemini-[2-5]\\\\..*\"\nrequest_types: \"dedicated-critical_plus\"\nrequest_types: \"shared-critical\"\nrequest_types: \"shared-sheddable_plus\"\nerror_codes: RESOURCE_EXHAUSTED\nerror_codes: UNAVAILABLE\nerror_codes: INTERNAL\nretry_strategy {\n min_delay {\n seconds: 1\n }\n max_delay {\n seconds: 5\n }\n max_retries: 2\n request_deadline_fraction: 1\n}\nretry_thresholds {\n threshold_type: PER_MODEL_RETRY_RATE\n threshold: 1\n threshold_duration {\n seconds: 10\n }\n}\nretry_thresholds {\n threshold_type: PER_MODEL_RETRY_RATE_LONG_CONTEXT\n threshold: 1\n threshold_duration {\n seconds: 10\n }\n}\nretry_threshold_fallback_behavior: RETRY_THRESHOLD_FALLBACK_BEHAVIOR_OPEN\n"
modifier {
value_operator: OVERRIDE
base_value: "config_id: \"ge_retries_config\"\nmodel_ids: \"gemini-2.0-flash-001\"\nmodel_ids: \"gemini-2.0-flash-lite-001\"\nmodel_ids: \"gemini-2.5-pro-preview\"\nmodel_ids: \"gemini-2.0-flash-preview\"\nmodel_ids: \"gemini-[2-5]\\\\..*\"\nrequest_types: \"dedicated-critical_plus\"\nrequest_types: \"shared-critical\"\nrequest_types: \"shared-sheddable_plus\"\nerror_codes: RESOURCE_EXHAUSTED\nerror_codes: UNAVAILABLE\nerror_codes: INTERNAL\nretry_strategy {\n min_delay {\n seconds: 1\n }\n max_delay {\n seconds: 5\n }\n max_retries: 2\n request_deadline_fraction: 1\n}\nretry_thresholds {\n threshold_type: PER_MODEL_RETRY_RATE\n threshold: 1\n threshold_duration {\n seconds: 10\n }\n}\nretry_thresholds {\n threshold_type: PER_MODEL_RETRY_RATE_LONG_CONTEXT\n threshold: 1\n threshold_duration {\n seconds: 10\n }\n}\nretry_threshold_fallback_behavior: RETRY_THRESHOLD_FALLBACK_BEHAVIOR_OPEN\nrequest_source: REQUEST_SOURCE_FLEX_API\n"
condition_group {
}
condition_index: 374
}
id: 0
'