w3c · alvestrand · May 30, 2023 · May 30, 2023 · Jun 2, 2023 · Jun 13, 2023
diff --git a/index.bs b/index.bs
@@ -61,21 +61,108 @@ This specification shows the IDL extensions for WebRTC.
 It uses an additional API on {{RTCRtpSender}} and {{RTCRtpReceiver}} to
 insert the processing into the pipeline.
 
+## RTCPeerConnection extensions ## {#rtcpeerconnection-extensions}
+In order to correctly identify the transformed content to the other end
+of the connection, new interfaces are added on {{RTCPeerConnection}} to
+allow the declaration and negotiation of new media types.
+
+<pre class="idl">
+// New methods for RTCPeerConnection
+partial interface RTCPeerConnection {
+   undefined AddSendCodecCapability(RTCRtpCodecCapability capability);
+   undefined AddReceiveCodecCapability(RTCRtpCodecCapability capability);
+};
+</pre>
+
+In addition, add two new internal slots to the RTCPeerConnection, called
+<dfn attribute dfn-for="RTCPeerConnection">\[[CustomSendCodecs]]</dfn>
+and <dfn attribute dfn-for="RTCPeerConnection">\[[CustomReceiveCodecs]]</dfn>,
+initially empty.
+
+### Methods ### {#rtcpeerconnection-methods}
+
+#### AddSendCodecCapability #### {#add-send-codec-capability}
+
+1. Let 'capability' be the "capability" argument
+1. If 'capability' is identical to the capability describing a previously configured codec,
+    throw an IllegalModification error.
+1. Add 'capability' to the PeerConnections's {{RTCPeerConnection/[[CustomSendCodecs]]}} internal slot.
+
+#### AddReceiveCodecCapability #### {#add-receive-codec-capability}
+
+1. Let 'capability' be the "capability" argument
+1. If 'capability' is identical to the capability describing a previously configured codec,
+    throw an IllegalModification error.
+1. Add 'capability' to the PeerConnections's {{RTCPeerConnection/[[CustomReceiveCodecs]]}} internal slot.
+
+## Extensions to RTCRtpSender and RTCRtpReceiver ## {#sender-receiver-extensions}
 <pre class="idl">
+
 typedef (SFrameTransform or RTCRtpScriptTransform) RTCRtpTransform;
 
-// New methods for RTCRtpSender and RTCRtpReceiver
+// New methods and attributes for RTCRtpSender and RTCRtpReceiver
 partial interface RTCRtpSender {
     attribute RTCRtpTransform? transform;
+    undefined setPacketizer(octet payloadType, RTCRtpCodec packetizer);
 };
 
 partial interface RTCRtpReceiver {
     attribute RTCRtpTransform? transform;
+    undefined setDepacketizer(octet payloadType, RTCRtpCodec depacketizer);
+    undefined addDecodingCodec(octet payloadType, RTCRtpCodec parameters);
 };
 </pre>
 
+### New methods ### {#sender-receiver-new-methods}
+<dl>
+<dt>setPacketizer</dt>
+<dd>
+When called, indicate that the packetizer should behave as if frames
+enqueued on the sender with the indicated payload type should be packetized
+according to the rules of the codec indicated.<br>
+It is up to the transform to ensure that all data and metadata conforms
+to the format required by the packetizer; the sender may drop frames that
+do not conform. {{RTCOutboundRtpStreamStats/framesSent}} is not incremented
+when a frame is dropped.
+</dd>
+<dt>setDepacketizer</dt>
+<dd>
+When called, indicate that incoming packets with the `payloadType` value in
+their PT field should be routed to a depacketizer that will perform
+depacketization according to the rules of `depacketizer`.
+</dd>
+<dt>addDecodingCodec</dt>
+<dd>
+When called, initialize a decoder in the RTCRtpReceiver to handle the format indicated by the `parameters` argument.
+</dd>
+</dl>
+
+<p class="note">
+The expected mode of operation is that the sender transform will transform
+the frame and change its `payloadType` to the negotiated value for the
+transformed content. The receiver transform will receive the `payloadType`,
+transform it, and set the `payloadType` to the value that corresponds to the format
+of the decoded frame, thus ensuring that it is decoded using the correct
+decoder.
+</p>
+
 ## Extension operation ## {#operation}
 
+### Media type negotiation ### {#sdp-negotiation}
+The following modifications are applied to the session negotation procedure:
+
+1. The {{RTCPeerConnection/[[CustomSendCodecs]]}} is included in the list of codecs which the user agent
+    is currently capable of sending; the {{RTCPeerConnection/[[CustomReceiveCodecs]]}} is included in the list of codecs which the user agent is currently prepared to receive, as referenced in "set a session description" steps.
+
+1. In the [$final steps to create an offer$], step 2.2 is changed to filter the list of codecs against
+    the union of the [=RTCRtpSender/list of implemented send codecs=] and the {{RTCPeerConnection/[[CustomSendCodecs]]}} slot, and step 2.3 is changed to filter against the union of the [=list of implemented receive codecs=] and the {{RTCPeerConnection/[[CustomReceiveCodecs]]}} slot.
+
+1. In the [$final steps to create an answer$], the same modification is done to the filtering in steps 1.2 and 1.3.
+
+
+
+
+### Codec initialization  ### {#codec-initialization}
 At the time when a codec is initialized as part of the encoder, and the
 corresponding flag is set in the {{RTCPeerConnection}}'s {{RTCConfiguration}}
 argument, ensure that the codec is disabled and produces no output.

diff --git a/sdp-explainer.md b/sdp-explainer.md
@@ -0,0 +1,137 @@
+# SDP negotiation in Encoded Transform
+
+The original definition of encoded transform did not consider negotation; the frames on the "transformed" side went out stamped with the payload type of the frame that came in on the "non-transformed" side (and vice versa for the receiver).
+
+This creates a problem, in that when an encoded transform is applied on the sending side, the bits on the wire may not correspond to what the SDP negotiation has declared that particular type to be used for. When only talking to another instance of the same application, that is not an issue, but it is an issue as soon as we want two different applications to interoperate - for instance when exchanging SFrame encrypted media between two endpoints from different application providers, or when exchanging SFrame encrypted content via an SFU that expects to be able to decode or modify the media.
+
+(The latter is exactly what Sframe is designed to prevent, but it is better for the intermediary to fail clean than to engage in possibly random behavior due to attempting to decode a stream that does not conform to the description it expects.)
+
+This problem is even more acute when the Encoded Transform is used to add support for codecs not natively supported by the browser; without the ability to influence SDP negotiation, there is no standard way to ensure that a receiver supporting the new codec is able to associate the payload type of incoming packets with the right decoder.
+
+For example, it's been proposed to add [Lyra](https://github.com/google/lyra) to WebRTC using an implementation in WASM
+- a working example using SDP munging can be found on the
+[Meetecho blog](https://www.meetecho.com/blog/playing-with-lyra/).
+
+# Requirements for an SDP negotiation API
+The following things need to be available on such an API:
+1. Before SDP negotiation, the application must be able to specify one or more new media types that one wants to negotiate for. As a point of illustration, this document uses the type "video/new-codec".
+2. After SDP negotiation, the application must be able to identify if negotiation has succeeded or failed, and what payload type has been assigned for the new media type.
+3. Before sending frames, the application must be able to inform the RTP sender of what kind of packetization to use on the outgoing frames.
+4. Before receiving frames, the application must be able to inform the RTP receiver of what kind of depacketization to use on the incoming frames.
+5. When transforming frames, the sending application must be able to mark the transformed frames with the negotiated payload type before sending.
+6. When transforming frames, the receiving application must be able to check that the incoming frame has the negotiated payload type, and (if reenqueueing the frame after transformation) mark the transformed frame with the appropriate payload type for decoding within the RTPReceiver.
+
+# API description
+
+## Codec description
+For codec description, we reuse the dictionary RTCRtpCodecCapability.
+
+The requirements on the parameters are:
+- either mimetype or fmtp parameters must be different from any existing capability
+
+When a codec capability is added, the SDP machinery will negotiate these codecs as normal, and the resulting payload type will be visible in RTCRtp{Sender,Receiver}.getParameters().
+
+
+## For SDP negotiation
+SDP negotiation is inherently an SDP property, and the proper target for that is therefore the RTCRtpTransceiver object.
+
+The transceiver will always be available before an offer or answer is created (either because of AddTrack or AddTransceiver on the sender side,
+or in the ontrack event on the side that receives the offer). At that time, the additional codecs to negotiate must be set:
+
+```
+transceiver.addCodec(RTCRtpCodec codec, RTCRtpCodec packetizationMode)
+```
+This will add the described codec to the list of codecs to be offered in createOffer/createAnswer. On successful negotiation
+of the codec, it will appear in the negotiated codec list returned from setParameters().
+
+Furthermore, the call will instruct the sender to packetize, and the receiver to depacketize, in accordance with the rules for the
+codec being given as the packetizationMode argument.
+
+Because it is sometimes inconvenient to intercept every call that creates transceivers, a convenience method is offered on the PC level:
+
+```
+pc.addCodecCapability(DOMString kind, RTCRtpCodec codec, RTCRtpCodec packetizationMode)
+```
+These calls will act as if the addCodec() call had been invoked on every transceiver created of the associated "kind".
+
+NOTE: The codecs will not show up on the static sender/receiver getCapabilities methods, since these methods can’t distinguish between capabilities used for different PeerConnections. They will show up in the list of codecs in RTCRtp{Sender,Receiver}.getParameters(), so they’re available to the RTCRtpSender for selection or deselection.
+
+
+
+## Existing APIs that will be used together with the new APIs
+- Basic establishing of EncodedTransform
+- getParameters() to get results of codec negotiation
+- encoded frame SetMetadata, to set the payload type for processed frames
+
+# Example code
+(This is incomplete)
+```
+customCodec = {
+   mimeType: “video/x-encrypted”,
+   clockRate: 90000,
+   fmtp = “encapsulated-codec=vp8”,
+};
+
+// At sender side
+pc.addCodecCapability('video', customCodec, {mimeType: "video/vp8"});
+
+// ...after negotiation
+
+const {codecs} = sender.getParameters();
+const {payloadType} = codecs.find(({mimeType}) => mimeType == "video/acme-encrypted");
+
+const worker = new Worker(`data:text/javascript,(${work.toString()})()`);
+sender.transform = new RTCRtpScriptTransform(worker, {payloadType});
+
+function work() {
+  onrtctransform = async ({transformer: {readable, writable, options}}) =>
+    await readable.pipeThrough(new TransformStream({transform})).pipeTo(writable);
+
+  function transform(frame, controller) {
+    // transform chunk
+    let metadata = frame.metadata();
+    encryptBody(frame);
+    metadata = frame.metadata();
+    metadata.pt = options.payloadType;
+    frame.setMetadata(metadata);
+    controller.enqueue(frame);
+  }
+}
+
+// At receiver side.
+const decryptedPT = 208; // Can be negotiated PT or locally-valid
+pc.addCodecCapability('video', customCodec, {mimeType: "video/vp8"});
+pc.ontrack = ({receiver}) => {
+  const {codecs} = sender.getParameters();
+  const encryptingCodec = codecs.find(({mimeType}) => mimeType == "video/acme-encrypted");
+   receiver.setDepacketizer(encryptingCodec.payloadType, {mimeType: "video/vp8"});
+   receiver.addDecodingCodec({mimeType: 'video/vp8', payloadType: decryptedPT});
+   const worker = new Worker(`data:text/javascript,(${work.toString()})()`);
+   receiver.transform = new RTCRtpScriptTransform(worker, {payloadType});
+
+  function work() {
+   transform = new TransformStream(
+      transform: (frame, controller) => {
+         if (frame.metadata().payloadType != encryptedPT) {
+           continue;  // Ignore all frames for the wrong payload.
+         }
+         decryptBody(frame);
+         metadata = frame.metadata();
+         metadata.payloadType = decryptedPT;
+         controller.enqueue(frame);
+       }
+   });
+   onrtctransform = async({transformer: {readable, writable, options}}) =>
+     await readable.pipeThrough(transform).pipeTo(writable);
+};
+```
+
+# Frequently asked questions
+
+1.  Q: My application wants to send frames with multiple packetizers. How do I accomplish that?
+
+    A: Use multiple payload types. Each will be assigned a payload type. Mark each frame with the payload type they need to be packetized as.
+
+1.  Q: What is the relationship between this proposal and the IETF standard for SFrame?
+
+    A: This proposal is intended to make it possible to implement the IETF standard for SFrame in Javascript, with only the packetizer/depacketizer being updated to support the SFrame packetization rules. It is also intended to make it possible to perform other forms of transform, including the ones that are presently deployed in the field, while marking the SDP with a truthful statement of its content.