diff --git a/index.html b/index.html index 240cb75..1b767d0 100644 --- a/index.html +++ b/index.html @@ -355,7 +355,7 @@
We use a pretrained CLIP model to predict the
- symbolic attribute that most-likely describes the entity. Notice that the entity labelled here (C_shape = ⚫️
)
+ symbolic attribute that most-likely describes the entity. Notice that the attribute labelled here (C_shape = ⚫️
)
ignores all other attributes of the entity.
The symbolic labelling process is repeated for each entity in the scene. The resultant symbolic - vectors are concatenated to form a symbolic embedding of the scene. + vectors are stacked to form a symbolic embedding of the scene.