Skip to content

MediaPipe v0.10.22

Latest
Compare
Choose a tag to compare
@rtg0795 rtg0795 released this 17 Mar 22:42
· 60 commits to master since this release

Build changes

  • [mediapipe] standardize import of androidx_annotation_annotation
  • [mediapipe] standardize import of androidx_appcompact
  • [mediapipe] standardize import of androidx_constraint_layout
  • [mediapipe] standardize import of androidx_core
  • [mediapipe] standardize import of androidx_legacy_legacy_support_v4
  • [mediapipe] delete unused 3p android_library androidx_material
  • [mediapipe] standardize import of androidx_recyclereview
  • [mediapipe] standardize import of camerax
  • Fix llm_engine_main build for DRISHTI_DISABLE_GPU=1

Framework and core calculator improvements

  • Updating Troubleshooting with VLOG info.
  • Update tensors_to_image_calculator.cc
  • Delegate memory-mapping the model file to the resource system
  • Add static helpers to timestamp classes
  • Remove use of designated initializers in tflite_model_loader.cc
  • Add support for INT64 in VectorIntToTensorCalculator.
  • Use renamed wgpu::ImageCopy* structures.
  • [mediapipe] improve  mediapipe_java_proto_src_extractor
  • Bump MP version for release 0.10.22.
  • [mediapipe] improve maven artifact template
  • Add two_tap_fir_filter_calculator and update com_google_audio_tools revision.
  • Adds check to reject services with an empty shared_ptr
  • Adds check to ensure input tensors match model tensor size & type
  • Replace MapName with StaticMap in places where it's not important to use MapName
  • Make DelayedReleaser an "attachement" of the GlContext instance.
  • Utility functions that create RGB images for testing.
  • Avoids the sharing of GL contexts between nested mediapipe graphs.
  • Adds output stream stats to GraphRuntimeInfo
  • Move ImageFrames while splitting a vector of ImageFrames.
  • Add input stream to control zoom factor used in content_zooming_calculator.
  • Introduces GPU synchronization when accessing GetOpenGlBufferReadViews from a different OpenGL context than was used for the GetOpenGlBufferWriteView.
  • Adds documentation about graph runtime monitoring.
  • Use wgpu::ShaderSourceWGSL instead of wgpu::ShaderModuleWGSLDescriptor.
  • [mediapipe] restore mediapipe_aar.bzl
  • Add CreateWgslShader utility.
  • Update resource loading in WebGpuShaderCalculator to latest API.
  • Added prompt templates for session in C API

MediaPipe Tasks update

This section should highlight the changes that are done specifically for any platform and don't propagate to
other platforms.

Android

  • [mediapipe] clean up an unused target ":llm" in core
  • [mediapipe] correct the protobuf_lite dependency
  • [mediapipe] move llm jni from "core" to "genai"
  • [mediapipe] move llm proto from "core" to "genai"
  • [mediapipe] build genai tasks with exact dependencies
  • [mediapipe] create genai's specifc ProgressListener and ErrorHandler
  • [mediapipe] build vision and image_generator tasks with exact dependencies
  • Don't use MediaPipeException in JNI layer
  • Make generateResponseAsync() return a ListenableFuture and add ProgressCallback to its arguments
  • Update JNI to enable litert CPU backend for LLM inference.
  • Delete engine when task is closed.

iOS

  • Add sequenceBatchSize option when setting up the inference engine..

Javascript

  • Fix DrawingUtils constructor failing in Web Workers
  • Change starting LoraModel ids from 0 to 1.
  • Add a function to determine what type of model (handwritten, converted) a file is
  • Fix tee not cancelling the parent stream when both children are cancelled
  • Distinguish between '.bin' and '.task' in createFrom*
  • Move streamToUint8Array from task runner lib to model loading utility lib, so the graph runner extensions would be able to utilize it.

MediaPipe Dependencies

  • Update WASM files for 0.10.21-rc.20250303 release
  • Update WASM files for 0.10.22 release