← Back

🎬 Scene Viewer

Video ID: article-1760308789004-628018

📹 Video Playback

Total Scenes
75
Detection Threshold
0.3
Detected
Invalid Date
Scene 1 (17.935s)
0:00 → 0:17
Frame 1
Frame 1 • 0:01
Frame 2
Frame 2 • 0:08
Frame 3
Frame 3 • 0:16
🎬 Visual: The scene begins with a woman in a studio setting, wearing a dark top and necklace against a purple backdrop, introducing the segment. In the middle frame, she interacts with a man and adjusts a wearable device, likely smart glasses. The scene transitions to a conference stage where a live demo is shown, featuring an augmented reality display from the perspective of the presenter as he interacts with an audience.
🎙️ Dialogue:

I'm looking at the teleprompter, and I'm also looking at Apple Music, and we're listening to Elton John. Oh, sorry. Thank you. That was louder than I expected. Sorry. I'm sorry. I don't know what I'm doing. I'm just remote-controlling your glasses. This is a strange time in consumer gadgetry.

Duration: 17.9s
Scene 2 (3.17s)
0:17 → 0:21
Frame 1
Frame 1 • 0:18
Frame 2
Frame 2 • 0:19
Frame 3
Frame 3 • 0:20
🎬 Visual: In this scene, a person wearing a VR headset is seated in a living room and views a virtual display showing a child playing a drum set. The key subjects are the headset user and the child on the drums, with the home setting visible in the background. Across the frames, the virtual display remains prominent and the child’s drumming continues, highlighting the ongoing remote interaction through the VR device.
🎙️ Dialogue:

because tech companies are pushing us toward their.

Duration: 3.2s
Scene 3 (1.376s)
0:21 → 0:22
Frame 1
Frame 1 • 0:21
Frame 2
Frame 2 • 0:21
Frame 3
Frame 3 • 0:22
🎬 Visual: A person wearing a beige sweater sits on a teal sofa in a well-lit living room, equipped with a white and black Apple headset. Throughout the sequence, the individual remains seated with minimal movement, while the environment—a room featuring a large pillow, a window with outside brick wall visible, and simple decor—stays unchanged. The only notable change across the frames is a slight adjustment in the person's posture.
🎙️ Dialogue:

vision of a new world.

Duration: 1.4s
Scene 4 (12.721s)
0:22 → 0:35
Frame 1
Frame 1 • 0:23
Frame 2
Frame 2 • 0:28
Frame 3
Frame 3 • 0:33
🎬 Visual: The scene begins with a presenter seated at a table, gesturing while introducing a pair of smart glasses placed in front of her. She then picks up the glasses, demonstrating or discussing their features. The scene transitions to an interview setting where two people are seated and having a conversation, with several pairs of smart glasses displayed on the table between them, indicating a discussion or evaluation of these devices.
🎙️ Dialogue:

world, one where you can just have AI information appear in front of your eyes with smart glasses, like these Ray-Ban displays from Meta. If you meet someone who wears these, you might not be able to...

Duration: 12.7s
Scene 5 (14.014s)
0:35 → 0:49
Frame 1
Frame 1 • 0:36
Frame 2
Frame 2 • 0:42
Frame 3
Frame 3 • 0:47
🎬 Visual: In this scene, the video transitions from an interview or discussion setting featuring a seated individual, to a live demonstration of an augmented reality interface showing app icons overlaid on a person's view, and finally to a segment highlighting a participant trying on Android XR glasses in an event space. Key actions include conversation, a technology demo, and hands-on interaction with extended reality devices. The context shifts from a studio environment to a tech showcase, indicating progression from explanation to practical demonstration.
Scene 6 (3.42s)
0:49 → 0:52
Frame 1
Frame 1 • 0:49
Frame 2
Frame 2 • 0:50
Frame 3
Frame 3 • 0:52
🎬 Visual: The scene displays a Bloomberg news article about Apple shelving its Vision Headset revamp to focus on Meta-like AI glasses. The main subject is a screenshot of the article, featuring a headline, a photo of the Apple Vision Pro headset on display in a store, and article details. Across the three frames, the article remains on screen with no notable visual changes, indicating a static segment providing context or information.
🎙️ Dialogue:

And rumors say Apple is working on making its own pair.

Duration: 3.4s
Scene 7 (5.547s)
0:52 → 0:58
Frame 1
Frame 1 • 0:53
Frame 2
Frame 2 • 0:55
Frame 3
Frame 3 • 0:57
🎬 Visual: The scene begins with a presenter seated at a table, gesturing and referencing a pair of smart glasses placed in front of her against a purple studio background. In the final frame, the scene shifts to a large auditorium setting where a speaker is addressing an audience, with a large screen behind him displaying visual aids and another individual. The progression shows a transition from a studio demonstration to a live presentation, highlighting the smart glasses and their potential public use.
🎙️ Dialogue:

If Apple has the chance to do something different, what can Apple bring to the space to make it feel...

Duration: 5.5s
Scene 8 (6.465s)
0:58 → 1:04
Frame 1
Frame 1 • 0:58
Frame 2
Frame 2 • 1:01
Frame 3
Frame 3 • 1:04
🎬 Visual: Scene 8 begins with a POV shot from a live demo event, where the key subject stands on stage near a large object. The middle frame shifts to a woman seated at a studio desk with smart glasses placed in front of her, indicating a product demonstration or review. In the final frame, a different person presents an HTC smartphone displaying a video, highlighting a change in technology context and possibly referencing past innovations (2013).
🎙️ Dialogue:

little more normal. For this week's Apple show I'm talking to CNET editor at large Scott Stein who is

Duration: 6.5s
Scene 9 (2.085s)
1:04 → 1:06
Frame 1
Frame 1 • 1:04
Frame 2
Frame 2 • 1:05
Frame 3
Frame 3 • 1:06
🎬 Visual: In these frames from Scene 9, a person is shown outdoors wearing smart glasses, with the CNET logo and the year "2019" visible on screen. The individual raises their hand and interacts by tapping the side of the smart glasses, suggesting activation or use of a feature. Between the first and last frame, the hand moves from the ear to the glasses, indicating a change in action or interaction with the device.
Scene 10 (1.251s)
1:06 → 1:07
Frame 1
Frame 1 • 1:06
Frame 2
Frame 2 • 1:07
Frame 3
Frame 3 • 1:07
🎬 Visual: In Scene 10, an individual stands or moves slightly on a residential sidewalk, with houses and a well-kept lawn visible under bright daylight. The frames show a consistent outdoor suburban setting, with minor changes in the camera angle and the positioning of the person's head. Overall, the key event is the individual being present on the sidewalk during daytime, possibly engaging in a spoken presentation or demonstration.
🎙️ Dialogue:

After covering the

Duration: 1.3s
Scene 11 (1.585s)
1:07 → 1:09
Frame 1
Frame 1 • 1:08
Frame 2
Frame 2 • 1:08
Frame 3
Frame 3 • 1:09
🎬 Visual: In this scene, a person wearing a buttoned shirt and a head-mounted device is shown standing indoors in front of a window during daylight. The subject begins with a hand gesture and remains mostly stationary as the camera gradually zooms out to reveal more of the surroundings. The environment is modern and well-lit, and the focus stays consistent on the wearable technology throughout the sequence.
🎙️ Dialogue:

space for over a decade.

Duration: 1.6s
Scene 12 (13.639s)
1:09 → 1:23
Frame 1
Frame 1 • 1:10
Frame 2
Frame 2 • 1:16
Frame 3
Frame 3 • 1:21
🎬 Visual: In this scene, a presenter introduces and discusses smart glasses on a studio set, starting with a physical pair of glasses on a table. Midway, an on-screen graphic highlights the topic: "A Vision for Apple Smart Glasses." The scene ends with a close-up online listing for Meta Ray-Ban Display glasses, indicating a shift to discussing product features and comparisons.
🎙️ Dialogue:

because I don't know if this is going to be my vision in the next decade. I'm Bridget Carey and this is One More Thing.

Duration: 13.6s
Scene 13 (4.504s)
1:23 → 1:27
Frame 1
Frame 1 • 1:23
Frame 2
Frame 2 • 1:25
Frame 3
Frame 3 • 1:27
🎬 Visual: In this scene, a presenter stands on stage during a live demo event, accompanied by on-screen subtitles explaining the features being discussed. The first two frames show the presenter gesturing near a white podium in a blue-lit auditorium, while the final frame transitions to a website interface for scheduling a demo of Meta Ray-Ban Display and Meta Neural Band devices. The scene shifts focus from the live demonstration to providing viewers information about where to experience the showcased technology.
Scene 14 (20.645s)
1:27 → 1:48
Frame 1
Frame 1 • 1:29
Frame 2
Frame 2 • 1:38
Frame 3
Frame 3 • 1:46
🎬 Visual: In this scene, the video begins with a screen showing the process of trying to book a Meta Ray-Ban Display Demo appointment, highlighting that no appointment slots are available. It then transitions to a presenter in a studio setting delivering commentary or explanation. The scene ends with a Bloomberg news article displayed on screen, reporting that Apple has paused updates to its Vision Headset to focus on developing AI glasses similar to Meta’s product, marking a shift in industry priorities.
🎙️ Dialogue:

It's hard to actually find a store to sell you a pair to take home unless you wait weeks for a fitting appointment. So I don't want to make it seem like the new display glasses are really popular, but it is something that tech enthusiasts are following closely to see what direction we are heading in this technology. Meanwhile, a report from Bloomberg had a lot of news outlets buzzing.

Duration: 20.6s
Scene 15 (2.002s)
1:48 → 1:50
Frame 1
Frame 1 • 1:48
Frame 2
Frame 2 • 1:49
Frame 3
Frame 3 • 1:50
🎬 Visual: In this scene, a person wearing a futuristic headset is seated in front of a dark backdrop illuminated by vertical blue light strips. Over the three frames, the individual slightly shifts their head position from side to center and back, indicating minor movements while remaining focused. The environment and headset suggest a tech demonstration or testing scenario, with no significant changes to the setting or objects between frames.
🎙️ Dialogue:

that Apple is shifting some resources.

Duration: 2.0s
Scene 16 (1.752s)
1:50 → 1:52
Frame 1
Frame 1 • 1:50
Frame 2
Frame 2 • 1:51
Frame 3
Frame 3 • 1:51
🎬 Visual: In this scene, a person is putting on a black VR headset, adjusting it against their head with one hand. The sequence shows the headset being initially positioned, then secured more firmly in the middle frame, and finally sitting completely in place by the end. The setting appears to be an indoor environment, possibly a studio or review space (as indicated by the CNET logo).
🎙️ Dialogue:

away from the Vision Pro.

Duration: 1.8s
Scene 17 (4.379s)
1:52 → 1:56
Frame 1
Frame 1 • 1:52
Frame 2
Frame 2 • 1:54
Frame 3
Frame 3 • 1:56
🎬 Visual: A woman with long blonde hair, wearing a navy dress and silver necklace, is presenting at a wooden table against a purple studio background with the CNET logo visible. Throughout the scene, she uses expressive hand gestures that start with her hands positioned near each other, move to a more neutral stance, and end with her arms spread wider. The shift in hand positions suggests she is emphasizing or elaborating on key points during her presentation.
Scene 18 (3.212s)
1:56 → 1:59
Frame 1
Frame 1 • 1:56
Frame 2
Frame 2 • 1:58
Frame 3
Frame 3 • 1:59
🎬 Visual: In this scene, a pair of smart glasses and a virtual reality headset are displayed on a wooden table next to a silver laptop, set against a blurred indoor background. The camera remains focused on these devices throughout, gradually zooming out to reveal more of the tabletop and surrounding environment. There are no changes in the arrangement of the objects, but the wider framing at the end provides additional context for the setting.
🎙️ Dialogue:

to compete with what Meta is doing and

Duration: 3.2s
Scene 19 (22.439s)
1:59 → 2:22
Frame 1
Frame 1 • 2:01
Frame 2
Frame 2 • 2:10
Frame 3
Frame 3 • 2:19
🎬 Visual: In Scene 19, a presenter seated at a desk in a studio with a purple backdrop demonstrates and discusses a technology product, using expressive hand gestures to explain its features. The scene transitions to a close-up shot through a pair of smart glasses, revealing a digital interface with floating icons overlaying the real-world view. The main change between frames is the shift from the presenter’s explanation to a direct visual demonstration of the augmented reality display, highlighting the product’s functionality.
🎙️ Dialogue:

this being an Apple show, we have to see what Apple could bring to this space. Okay, thank you for joining me, Scott. I know you've been testing these for a while. We've been talking while you're testing. Are you seeing me or the display? I'm seeing you. Why? What does it look like? No, it looks normal. Just looks like regular glasses, right? But if I do that, I can bring up the display. So now there's a display that's floating in this eye.

Duration: 22.4s
Scene 20 (1.376s)
2:22 → 2:23
Frame 1
Frame 1 • 2:22
Frame 2
Frame 2 • 2:22
Frame 3
Frame 3 • 2:23
🎬 Visual: In this scene, a woman with blonde hair, dressed in a navy blue top and necklace, is seated against a purple studio backdrop with the CNET logo visible in the corner. She appears to shift her posture slightly between frames, possibly turning to engage with another person who enters the right side of the final frame. The overall context suggests a studio interview or discussion setting, with the main change being the introduction of a second participant by the scene's end.
🎙️ Dialogue:

I can't tell.

Duration: 1.4s
Scene 21 (6.09s)
2:23 → 2:29
Frame 1
Frame 1 • 2:24
Frame 2
Frame 2 • 2:26
Frame 3
Frame 3 • 2:28
🎬 Visual: In this scene, a person demonstrates putting on and interacting with a pair of smart glasses, with the brand "Meta Ray-Ban Wayfarer" identified in the final frame. The first frame shows the initial gesture of donning the glasses, the second frame involves discussion or explanation with another individual present, and the final frame provides a close-up of the person touching the glasses in an outdoor setting. The scene transitions from an indoor studio with a purple background to an outdoor environment, highlighting both the product and user interaction.
🎙️ Dialogue:

But if I, like, record, like, that shows it's recording. Okay, yeah. But that's like before.

Duration: 6.1s
Scene 22 (3.128s)
2:29 → 2:32
Frame 1
Frame 1 • 2:29
Frame 2
Frame 2 • 2:31
Frame 3
Frame 3 • 2:32
🎬 Visual: The scene displays a close-up view of a black wearable band, identified as the "Meta Neural Band," resting on a wooden surface next to a black object. The middle frame introduces an overlay text labeling the device, while the other frames retain the same setting and composition without label. The main change across the frames is the appearance and disappearance of the "META NEURAL BAND" text overlay, confirming the subject's identity.
Scene 23 (14.264s)
2:32 → 2:47
Frame 1
Frame 1 • 2:34
Frame 2
Frame 2 • 2:39
Frame 3
Frame 3 • 2:45
🎬 Visual: In this scene, two presenters stand against a purple backdrop and engage in conversation, with one gesturing while speaking at first. The middle frame shows both subjects interacting, likely discussing a topic. By the final frame, the focus shifts to the second presenter, indicating a transition in speaking roles or emphasis.
🎙️ Dialogue:

You've had a lot of fun playing with this, but why would Apple want to invest in getting into this space and doing something like these display smart glasses? Part of it may be cost versus effort for something like this.

Duration: 14.3s
Scene 24 (2.92s)
2:47 → 2:49
Frame 1
Frame 1 • 2:47
Frame 2
Frame 2 • 2:48
Frame 3
Frame 3 • 2:49
🎬 Visual: In this scene, a close-up of the Apple Vision Pro headset is displayed on a dark surface with a black backdrop. At the start, only the headset is visible; in the middle and final frames, an on-screen label appears identifying the product as "Apple Vision Pro" and listing its price as $3,500. The scene setting remains consistent throughout, with the only notable change being the addition of the product name and pricing information.
🎙️ Dialogue:

because the Vision Pro, it's not like they haven't worked.

Duration: 2.9s
Scene 25 (3.086s)
2:49 → 2:53
Frame 1
Frame 1 • 2:50
Frame 2
Frame 2 • 2:51
Frame 3
Frame 3 • 2:52
🎬 Visual: In this scene, a subject seated indoors interacts with an Apple Vision Pro headset, as indicated by the text overlay and visible device. At first, the subject sits passively, and then in the following frames, raises a hand to touch or adjust the headset. The setting stays consistent throughout, with blue illuminated lines in the background and the CNET logo present, while the main action involves the subject engaging with the device.
🎙️ Dialogue:

in AR and VR already. They already have this bleeding edge.

Duration: 3.1s
Scene 26 (1.543s)
2:53 → 2:54
Frame 1
Frame 1 • 2:53
Frame 2
Frame 2 • 2:53
Frame 3
Frame 3 • 2:54
🎬 Visual: In this scene, a finger presses the button on the side of a metallic, mesh-textured device against a blurred blue and brown background. The main action shows the finger approaching, pressing, and then releasing the button, with the device remaining the focal object throughout. The notable change across the frames is the button's depression and release, indicating interaction or activation.
Scene 27 (0.835s)
2:54 → 2:55
Frame 1
Frame 1 • 2:54
Frame 2
Frame 2 • 2:54
Frame 3
Frame 3 • 2:55
🎬 Visual: In this scene, a close-up sequence shows a man wearing a VR headset, with the camera gradually zooming out or shifting to reveal more of his face and the headset. The main subject is the headset itself, which is prominently displayed against a dark background, suggesting a tech review or demonstration setting. Across the three frames, the man's expression remains neutral, and the scene focuses on highlighting the design and fit of the device, with minimal change except for the camera's perspective.
🎙️ Dialogue:

had said.

Duration: 0.8s
Scene 28 (6.006s)
2:55 → 3:01
Frame 1
Frame 1 • 2:55
Frame 2
Frame 2 • 2:58
Frame 3
Frame 3 • 3:00
🎬 Visual: Two people are engaged in a discussion on camera, seated against a solid purple backdrop with "CNET" branding visible on one participant's shirt. The main action consists of the man gesturing with his hand as he speaks, while the woman listens attentively. Throughout the frames, the interaction shifts from mutual attention to active explanation, indicating a conversational exchange in a studio setting.
🎙️ Dialogue:

What are they going to sell to people that doesn't cost $3,500? And I think glasses are the more attainable goal.

Duration: 6.0s
Scene 29 (27.902s)
3:01 → 3:29
Frame 1
Frame 1 • 3:04
Frame 2
Frame 2 • 3:15
Frame 3
Frame 3 • 3:26
🎬 Visual: This scene begins with a stylized visual featuring concentric colorful rings, suggesting a transition or introduction. It then shifts to a studio shot of a person in front of a purple background, providing commentary or context. The scene concludes with a close-up of a hand holding a smartphone and using its camera, set in a busy showroom environment with other people in the background, indicating a product demonstration or hands-on review.
🎙️ Dialogue:

I mean, Apple has camera tech, they have AirPods and audio tech, and they also work in wearables like watches, and they have the ARVR experience. If they're going to become a company that's a big player in AI, which right now they're behind in, that's an area that the glasses could be involved with. Yeah, I keep going through what could Apple do differently and what advantages does it have. Obviously, Meta does not have a phone or a smartwatch.

Duration: 27.9s
Scene 30 (1.418s)
3:29 → 3:30
Frame 1
Frame 1 • 3:29
Frame 2
Frame 2 • 3:30
Frame 3
Frame 3 • 3:30
🎬 Visual: In this scene, a person is fitting a wearable device, labeled as the "Meta Neural Band," onto another person's wrist in a casual indoor setting. The action progresses from the initial attachment of the band to securing it around the wrist. Notably, text identifying the device appears in the second frame, and the positioning of the hands and band adjusts slightly as the task is completed.
Scene 31 (2.962s)
3:30 → 3:33
Frame 1
Frame 1 • 3:31
Frame 2
Frame 2 • 3:32
Frame 3
Frame 3 • 3:33
🎬 Visual: In this scene, a seated person wearing a Meta Neural Band and a smartwatch is shown performing a hand gesture with a closed fist. The setting appears to be a casual, indoor space with patterned walls and flooring. Across the three frames, the person's hand remains in a similar closed fist position, and the only notable change is the disappearance of the "META NEURAL BAND" label from the beginning to the end of the sequence.
🎙️ Dialogue:

neural band to do gesture controls, but...

Duration: 3.0s
Scene 32 (3.17s)
3:33 → 3:36
Frame 1
Frame 1 • 3:33
Frame 2
Frame 2 • 3:35
Frame 3
Frame 3 • 3:36
🎬 Visual: In this scene, a close-up shot of an Apple Watch (Series 11) is shown outdoors against a blurred green background. The beginning frame displays the watch face, followed by the middle frame where a label "APPLE WATCH SERIES 11" appears at the top, identifying the device. By the final frame, the label disappears, with the focus remaining on the watch and no major changes to the watch display itself.
🎙️ Dialogue:

apple has an apple watch with gesture controls so like where

Duration: 3.2s
Scene 33 (10.218s)
3:36 → 3:47
Frame 1
Frame 1 • 3:37
Frame 2
Frame 2 • 3:41
Frame 3
Frame 3 • 3:46
🎬 Visual: In Scene 33, a woman is initially shown speaking and gesturing with her hands against a purple studio background. As the scene progresses, she continues talking before a man joins her in the final frame, and they engage in a discussion, both using expressive hand gestures. The setting remains consistent throughout, with the key change being the transition from a single speaker to an interactive conversation between the two subjects.
🎙️ Dialogue:

could Apple maybe take it to a point where it might solve some of the hiccups that you find as a gadget reviewer? Yeah, I mean some of it may seem redundant if you think about, oh I'm

Duration: 10.2s
Scene 34 (3.128s)
3:47 → 3:50
Frame 1
Frame 1 • 3:47
Frame 2
Frame 2 • 3:48
Frame 3
Frame 3 • 3:49
🎬 Visual: In Scene 34, a pair of white wireless earbuds in their charging case and a smart watch are displayed on a light-colored surface in close-up. The sequence begins with the camera focused on the earbuds, then shifts focus to highlight the watch behind them, with both items remaining stationary throughout. The scene context suggests a product showcase, and the main change between frames is the transition of camera focus from the earbuds to the watch.
Scene 35 (7.8s)
3:50 → 3:57
Frame 1
Frame 1 • 3:50
Frame 2
Frame 2 • 3:54
Frame 3
Frame 3 • 3:57
🎬 Visual: The scene takes place in a studio setting with a purple backdrop and CNET branding, featuring two people engaged in a conversation. The first frame shows a man gesturing with his hand, while the second and third frames reveal that he is joined by a woman, and both are actively communicating with expressive hand movements. The transition from a single subject to an interactive exchange between the two indicates a shift from monologue to dialogue during the scene.
🎙️ Dialogue:

things do I need to be wearing at the same time? But sometimes that happens. Sometimes one product catabolizes another product that's out there.

Duration: 7.8s
Scene 36 (3.128s)
3:57 → 4:01
Frame 1
Frame 1 • 3:58
Frame 2
Frame 2 • 3:59
Frame 3
Frame 3 • 4:00
🎬 Visual: In this scene, a person holds up an Apple Watch SE 3 in a well-lit, indoor setting. The watch is first shown from the side, then rotated to reveal the front display with the time, while an on-screen label identifies the model. The main change between frames is the watch’s orientation, shifting from a side view to a full front view highlighting the watch face and design.
🎙️ Dialogue:

but they could use the watch to control the glasses.

Duration: 3.1s
Scene 37 (2.169s)
4:01 → 4:03
Frame 1
Frame 1 • 4:01
Frame 2
Frame 2 • 4:02
Frame 3
Frame 3 • 4:03
🎬 Visual: Two people are standing in front of a purple backdrop, engaged in a discussion. The man demonstrates something with his hands during the scene, starting with a closed gesture and ending with open hands, indicating a possible explanation or demonstration. The CNET logo is present, suggesting a technology-focused context.
🎙️ Dialogue:

The neural band is the beginning of that.

Duration: 2.2s
Scene 38 (12.012s)
4:03 → 4:15
Frame 1
Frame 1 • 4:04
Frame 2
Frame 2 • 4:09
Frame 3
Frame 3 • 4:14
🎬 Visual: The scene begins with a close-up shot of a Meta Ray-Ban smart glasses and a wearable device displayed on a wooden tray. It then transitions to two people in conversation in a studio setting with a purple background, both wearing the wristband and one wearing glasses. By the final frame, the focus shifts to a close-up of one individual as the discussion continues, emphasizing the wearable technology being presented.
🎙️ Dialogue:

wrist-to-glasses relationship, and then a lot of things that are holding back smart glasses like early smart watches are the connection with phones. You know, the Ray-Ban displays only run a handful of apps that are metas.

Duration: 12.0s
Scene 39 (2.002s)
4:15 → 4:17
Frame 1
Frame 1 • 4:15
Frame 2
Frame 2 • 4:16
Frame 3
Frame 3 • 4:17
🎬 Visual: In this scene, a woman stands in front of a bookshelf and examines a record or book with an image of a glass and spilled liquid on its cover. She initially appears to be reading or inspecting the item, briefly gestures with her hand, and then holds the item steady while looking at it. The setting remains consistent, with a "NO PASAR" sign and decorative objects in the background, and the woman's posture transitions from a side view to facing forward.
Scene 40 (4.88s)
4:17 → 4:22
Frame 1
Frame 1 • 4:17
Frame 2
Frame 2 • 4:19
Frame 3
Frame 3 • 4:21
🎬 Visual: In this scene, a person holds up a vinyl record cover titled "I've Tried Everything But Therapy" against the backdrop of a live audience at a TED event. The frames show the record being presented, with its text and imagery visible, while subtitles appear in both Hindi and English at the bottom. Between the frames, the lighting and subtitle language shift slightly, highlighting the dynamic context of an on-stage demonstration or presentation.
🎙️ Dialogue:

which should have glasses that have a deeper understanding of your phone.

Duration: 4.9s
Scene 41 (17.892s)
4:22 → 4:40
Frame 1
Frame 1 • 4:23
Frame 2
Frame 2 • 4:31
Frame 3
Frame 3 • 4:38
🎬 Visual: In this scene, two presenters stand and discuss a topic in front of a purple backdrop, using expressive hand gestures to emphasize their points. By the end of the sequence, the focus shifts to a close-up of a VR headset placed on a table. The notable change between the frames is the transition from an interactive discussion to a product showcase.
🎙️ Dialogue:

and if Apple does that they could triangulate and do that. I say could because the Vision Pro still doesn't connect with an iPhone and it doesn't work with the Apple Watch so you know sometimes you think it's going to happen and then it still takes a while. All right we also talked about Vision Pro. It's not a product that's dead but obviously it's not a product that's dead.

Duration: 17.9s
Scene 42 (3.796s)
4:40 → 4:43
Frame 1
Frame 1 • 4:40
Frame 2
Frame 2 • 4:41
Frame 3
Frame 3 • 4:43
🎬 Visual: In Scene 42 (280.1s - 283.9s), the video closely examines a person wearing a white, high-tech headset, likely a VR or AR device, with emphasis on its design features like the headband, adjustment knob, and front visor. The sequence progresses from a side view focused on the ear and headband, to a mid-frame highlighting the visor and its reflective surface, and finally to a full close-up of the visor over the user's face. The setting appears to be indoors, and the key change between frames is the shifting camera angle that increasingly emphasizes the headset's front, showcasing its build and user interaction.
🎙️ Dialogue:

Obviously, it's not very desirable right now, it's not very popular, and it's very expensive.

Duration: 3.8s
Scene 43 (4.588s)
4:43 → 4:48
Frame 1
Frame 1 • 4:44
Frame 2
Frame 2 • 4:46
Frame 3
Frame 3 • 4:47
🎬 Visual: In this scene, two people are having a discussion while standing in front of a purple studio backdrop branded with the CNET logo. The woman on the left starts by gesturing expressively with her hands, then lowers her hands and maintains a conversational posture as the scene progresses, while the man on the right remains attentive throughout. The frames show a shift from active explanation to a more relaxed conclusion of the interaction.
Scene 44 (3.045s)
4:48 → 4:51
Frame 1
Frame 1 • 4:48
Frame 2
Frame 2 • 4:49
Frame 3
Frame 3 • 4:51
🎬 Visual: The scene features a close-up of a pair of Meta Ray-Ban Wayfarer smart glasses and a VR headset on a wooden table, with a portable device nearby. At the start, a colored progress bar appears, which is soon replaced by an overlay displaying the product's name and price ("META RAY-BAN WAYFARER $459"). The overlay remains visible at the end, highlighting the key subject and providing product information in a tech review context.
🎙️ Dialogue:

display glasses can avoid some

Duration: 3.0s
Scene 45 (2.544s)
4:51 → 4:54
Frame 1
Frame 1 • 4:51
Frame 2
Frame 2 • 4:52
Frame 3
Frame 3 • 4:53
🎬 Visual: In Scene 45, a VR headset is shown resting on a glossy surface against a blue, studio-lit background, with the CNET logo visible in the corner. The three frames capture a static product shot, with no significant movement or change in the position of the headset or the setting throughout the sequence. The focus remains on the headset as the key subject, highlighting its design and features for the video review or analysis.
🎙️ Dialogue:

of the struggles that a vision pro had.

Duration: 2.5s
Scene 46 (8.508s)
4:54 → 5:02
Frame 1
Frame 1 • 4:54
Frame 2
Frame 2 • 4:58
Frame 3
Frame 3 • 5:01
🎬 Visual: In Scene 46, the segment begins with a person speaking in front of a purple studio background, then transitions to a screen capture of the Apple website displaying the Apple Vision Pro product page. The scene highlights details about purchasing and personalizing the Apple Vision Pro, including measuring for fit, while the product image rotates between frames. The main shift through the frames is from an on-camera presenter to a digital walkthrough of the purchase process for the Apple Vision Pro headset.
🎙️ Dialogue:

I think one is definitely price. $3,500 was always so far beyond anything that I would normally spend money on. So that's a huge problem.

Duration: 8.5s
Scene 47 (13.055s)
5:02 → 5:15
Frame 1
Frame 1 • 5:03
Frame 2
Frame 2 • 5:09
Frame 3
Frame 3 • 5:14
🎬 Visual: In this scene, a man is first shown alone in close-up against a purple-lit studio background. In the following frames, a woman joins him and they appear engaged in a conversation, gesturing with their hands. The scene transitions from a solo shot to a two-person interaction, with both subjects standing in the studio and discussing a topic on camera.
Scene 48 (3.462s)
5:15 → 5:19
Frame 1
Frame 1 • 5:15
Frame 2
Frame 2 • 5:17
Frame 3
Frame 3 • 5:18
🎬 Visual: Scene 48 shows a close-up comparison of two wearable devices, a pair of smart glasses and a VR headset, both placed side-by-side on a wooden shelf against a softly lit background. Over the three frames, the camera gradually shifts focus from the glasses (left) to the VR headset (right), with the objects remaining stationary throughout. The transition highlights the design differences between the devices, with branding from CNET visible in the lower left corner.
🎙️ Dialogue:

but also they really work out prescription stuff so that it can work with a lot.

Duration: 3.5s
Scene 49 (22.814s)
5:19 → 5:41
Frame 1
Frame 1 • 5:21
Frame 2
Frame 2 • 5:30
Frame 3
Frame 3 • 5:39
🎬 Visual: In this scene, two individuals participate in a discussion or interview within a studio setting, as indicated by the purple backdrop and CNET branding. The sequence begins with one person speaking, shifts to the second person who appears to be responding or explaining, and concludes with both individuals engaged in an active conversation facing each other. Throughout the scene, the interaction transitions from individual commentary to direct exchange between the subjects.
🎙️ Dialogue:

people's eyes which is still a challenge with a lot of smart glasses. Maybe give it extra finesse. Maybe have like zoom modes on the camera or other cool things you could like use the Apple Watch as a viewfinder or who knows what. Okay that brings me to this thought about is this the future of computing or are we on a side quest right now? Yes yes and um we're sort of on a side quest you're right.

Duration: 22.8s
Scene 50 (1.21s)
5:41 → 5:43
Frame 1
Frame 1 • 5:42
Frame 2
Frame 2 • 5:42
Frame 3
Frame 3 • 5:42
🎬 Visual: In this scene, a person is seated at a wooden table in front of a brick wall, wearing a white head-mounted device and typing on a wireless keyboard. Throughout the frames, the individual maintains their typing activity while product labels and branding ("APPLE VISION PRO" and "CNET") appear onscreen. The setting and objects (keyboard, trackpad, wireless earbuds case, and external device) remain consistent, with the notable change being the appearance of the "APPLE VISION PRO" label in the middle and end frames to indicate the featured product.
🎙️ Dialogue:

all the vr things

Duration: 1.2s
Scene 51 (3.503s)
5:43 → 5:46
Frame 1
Frame 1 • 5:43
Frame 2
Frame 2 • 5:44
Frame 3
Frame 3 • 5:46
🎬 Visual: In this scene, a virtual text document titled "Apple Vision Pro Review Draft" is displayed in a home setting, with the subject interacting with it and progressively typing the sentence "I am sitting here" and then "I am sitting here durin". The lighting equipment and camera setup indicate that the scene is being recorded or documented, and the workspace remains consistent throughout. The key change across the frames is the advancement in the typed text on the virtual screen, demonstrating input and interaction with the Apple Vision Pro device.
Scene 52 (3.087s)
5:46 → 5:49
Frame 1
Frame 1 • 5:46
Frame 2
Frame 2 • 5:48
Frame 3
Frame 3 • 5:49
🎬 Visual: In this scene, a person sitting at a table interacts with augmented reality objects—visible on a smartphone screen—while wearing AR spectacles. The subject uses hand gestures above the table to engage with the colorful AR blocks, as highlighted by the on-screen label "SNAP AR SPECTACLES." Across the frames, the subject's hands move from resting on the table to being raised, demonstrating ongoing interaction with the virtual objects.
🎙️ Dialogue:

A lot of the missing part is the blending.

Duration: 3.1s
Scene 53 (32.157s)
5:49 → 6:21
Frame 1
Frame 1 • 5:52
Frame 2
Frame 2 • 6:05
Frame 3
Frame 3 • 6:18
🎬 Visual: The scene features two people standing and conversing in a studio setting with a purple background. At the start and end, both individuals are visible engaging with each other, while the mid-frame focuses on the woman, suggesting she is speaking or explaining something. The CNET logo in the corner and the consistent background indicate this is part of a studio-produced segment, with the conversation or discussion transitioning between both subjects.
🎙️ Dialogue:

So I feel like this is the side quest to figure out like contextual AI in the world so it can recognize things and know what to do when you're in a place as opposed to just giving you stuff that doesn't seem aware of where you are. And also there's just another element of like, will this ever feel normal? There's been times where having a smartphone in our hand wasn't normal, but I'm just wondering what Apple could do so maybe people go, oh yeah, I do want tech on my face because I don't know if we're there yet. It's like everything gets weird until suddenly everyone's doing it sometimes.

Duration: 32.2s
Scene 54 (9.301s)
6:21 → 6:31
Frame 1
Frame 1 • 6:22
Frame 2
Frame 2 • 6:26
Frame 3
Frame 3 • 6:30
🎬 Visual: In this scene, a person wearing AirPods is shown standing outdoors in a park, highlighting the appearance of the wireless earbuds. The scene then cuts to a screenshot of a Reddit post humorously comparing AirPods to another object through side-by-side images. The context shifts from a real-life demonstration to online commentary, with the subject and the AirPods remaining central throughout.
🎙️ Dialogue:

I think about airpods, like I remember way back when I wore airpods the first time. It was a meme, everyone thought it was absurd, um, they were like get those cigarette butts out of your ears and then everybody's wearing them.

Duration: 9.3s
Scene 55 (2.794s)
6:31 → 6:33
Frame 1
Frame 1 • 6:31
Frame 2
Frame 2 • 6:32
Frame 3
Frame 3 • 6:33
🎬 Visual: In this scene, the video displays an article headline and byline from CNET, focusing on the topic of Apple's AirPods one year after their release. The main subject is the article discussing how AirPods evolved from being joked about to becoming essential technology, and poses questions about the future of Apple's wireless headphones. The frames show the webpage remaining unchanged throughout the scene, suggesting the viewer is meant to read and absorb the article's headline and introduction.
Scene 56 (3.087s)
6:33 → 6:37
Frame 1
Frame 1 • 6:34
Frame 2
Frame 2 • 6:35
Frame 3
Frame 3 • 6:36
🎬 Visual: In this scene, an individual with red hair at an event or expo is shown picking up and putting on a pair of smart glasses or AR glasses. The setting includes multiple people and technology-related booths, indicated by branded signage and tech displays in the background. Over the three frames, the key action is the subject lifting the glasses toward their face and then positioning them on their head, while the scene context remains consistent.
🎙️ Dialogue:

It's already getting more normal to wear smart glasses, even though there's a...

Duration: 3.1s
Scene 57 (2.544s)
6:37 → 6:39
Frame 1
Frame 1 • 6:37
Frame 2
Frame 2 • 6:38
Frame 3
Frame 3 • 6:39
🎬 Visual: In this scene, a person is seen putting on a pair of Ray-Ban Wayfarer glasses, starting by holding the glasses near their face and then settling them into position. The background features a colorful mural on a brick wall, suggesting an urban or outdoor setting. The sequence highlights the action of wearing the glasses, with the subject's hand lowering away by the final frame.
🎙️ Dialogue:

with the idea of

Duration: 2.5s
Scene 58 (2.502s)
6:39 → 6:42
Frame 1
Frame 1 • 6:39
Frame 2
Frame 2 • 6:40
Frame 3
Frame 3 • 6:41
🎬 Visual: A person sits on a couch using a laptop, wearing XREAL Air 2 Pro smart glasses, as indicated by the label on-screen. The setting is a casual indoor space with bookshelves and plants in the background. Throughout the three frames, the individual remains in a similar position, focused on the laptop, with no notable changes in action or environment.
🎙️ Dialogue:

and data, and are you recording me, et cetera.

Duration: 2.5s
Scene 59 (2.711s)
6:42 → 6:44
Frame 1
Frame 1 • 6:42
Frame 2
Frame 2 • 6:43
Frame 3
Frame 3 • 6:44
🎬 Visual: In this scene, a man wearing a gray shirt and a Ray-Ban device is walking along a city sidewalk adjacent to a brick building, with other pedestrians and seated people visible in a small park area to the left. Throughout the frames, the man moves steadily closer to the camera, while the background features urban elements like construction cones and scaffolding, with pedestrian activity remaining visible but shifting. The main change across the frames is the man’s increasing proximity to the camera, emphasizing his movement through the urban environment.
🎙️ Dialogue:

people's reactions to me wearing these out in public.

Duration: 2.7s
Scene 60 (2.503s)
6:44 → 6:47
Frame 1
Frame 1 • 6:45
Frame 2
Frame 2 • 6:46
Frame 3
Frame 3 • 6:47
🎬 Visual: In this scene, close-up shots focus on the side of a person's head, specifically highlighting the ear and a white wearable device, which is Google Glass. The framing remains nearly identical throughout the three frames, with no significant changes in subject position or context. The setting is neutral, and the scene emphasizes the device's design, accompanied by on-screen text referencing "Google Glass 2013" and the "CNET" logo.
🎙️ Dialogue:

is a lot less intense than Google Glass Days.

Duration: 2.5s
Scene 61 (10.218s)
6:47 → 6:57
Frame 1
Frame 1 • 6:48
Frame 2
Frame 2 • 6:52
Frame 3
Frame 3 • 6:56
🎬 Visual: In this scene, a woman initially gestures as she speaks in front of a purple backdrop. Partway through, a man enters the frame and they engage in an animated conversation, both using expressive hand gestures. The scene transitions from a solo presentation to a collaborative discussion, with clear interaction and engagement between the two subjects.
🎙️ Dialogue:

I can't even tell that these have display screens. I don't know when you're looking at some computer overlay on top of me and- Well, that's what's surprising about it.

Duration: 10.2s
Scene 62 (2.044s)
6:57 → 6:59
Frame 1
Frame 1 • 6:57
Frame 2
Frame 2 • 6:58
Frame 3
Frame 3 • 6:59
🎬 Visual: In this scene, a person wearing a light gray, patterned shirt is seated in front of a dark, metallic background. Over the three frames, the main action is the individual turning their head from facing forward to facing left, with the addition of eyeglasses visible by the middle frame. The setting remains consistent throughout, with no other significant changes in objects or context.
🎙️ Dialogue:

You basically are...

Duration: 2.0s
Scene 63 (2.586s)
6:59 → 7:02
Frame 1
Frame 1 • 6:59
Frame 2
Frame 2 • 7:00
Frame 3
Frame 3 • 7:01
🎬 Visual: In this scene set against a purple background, a person is shown in profile, wearing a grey polka-dot shirt, glasses, and a smartwatch. Over the three frames, the key action is the person's hand shifting position slightly while resting against their chin, suggesting a thoughtful or contemplative gesture. The scene remains static with no changes in setting or camera angle, focusing on the subtle movement of the hand and wrist.
Scene 64 (3.545s)
7:02 → 7:05
Frame 1
Frame 1 • 7:02
Frame 2
Frame 2 • 7:03
Frame 3
Frame 3 • 7:05
🎬 Visual: In Scene 64, the video presents a close-up sequence of smart glasses against a purple background. The frames progress from an abstract view of the glasses' arm, to a detailed shot revealing the Meta and Ray-Ban logos and a camera module, and finally to an even closer focus on the right lens with branding in view. This scene highlights the glasses' design features and branding, gradually shifting attention from the structure to specific functional and brand details.
🎙️ Dialogue:

is only a small area where you can see the reflection. Do you find?

Duration: 3.5s
Scene 65 (6.674s)
7:05 → 7:12
Frame 1
Frame 1 • 7:06
Frame 2
Frame 2 • 7:09
Frame 3
Frame 3 • 7:11
🎬 Visual: Two people are standing and conversing in front of a purple studio background, with visible CNET branding. The woman on the left is actively gesturing with her hands throughout the scene, indicating she is explaining or emphasizing a point, while the man on the right listens attentively. The main change across the frames is the progression of the woman's hand gestures, highlighting her role as the primary speaker in this segment.
🎙️ Dialogue:

that you're kind of venturing into new territory of ethics. Like when you're like, when, when is it right?

Duration: 6.7s
Scene 66 (2.877s)
7:12 → 7:15
Frame 1
Frame 1 • 7:12
Frame 2
Frame 2 • 7:13
Frame 3
Frame 3 • 7:14
🎬 Visual: In Scene 66, a digital map interface is displayed through smart glasses, initially showing nearby locations such as Webster Hall and Katz’s Delicatessen. As the scene progresses, the focus shifts to highlight "La Colombe Coffee Workshop," providing detailed information including its rating, distance, and hours. The background remains consistent throughout, while the digital overlay changes to emphasize a specific destination and its details.
🎙️ Dialogue:

to be activating your smart glasses.

Duration: 2.9s
Scene 67 (3.712s)
7:15 → 7:18
Frame 1
Frame 1 • 7:15
Frame 2
Frame 2 • 7:17
Frame 3
Frame 3 • 7:18
🎬 Visual: In this scene, two individuals stand and interact in front of a purple backdrop, engaging in conversation; one gestures with her hands while the other listens. The setting is a studio environment, indicated by professional attire and the "CNET" logo. By the final frame, the focus shifts to a close-up of the man as he looks to his left, suggesting a transition or end to the exchange.
Scene 68 (3.087s)
7:18 → 7:22
Frame 1
Frame 1 • 7:19
Frame 2
Frame 2 • 7:20
Frame 3
Frame 3 • 7:21
🎬 Visual: A person wearing glasses and a light blue patterned shirt is seated in front of a dark metallic background, positioned at a table. At the beginning of the scene, the subject is turned slightly to the side, then gradually faces forward by the final frame. The setting remains consistent throughout, with minimal movement or change in context.
🎙️ Dialogue:

somebody else or an AI and now it's like

Duration: 3.1s
Scene 69 (15.015s)
7:22 → 7:37
Frame 1
Frame 1 • 7:23
Frame 2
Frame 2 • 7:29
Frame 3
Frame 3 • 7:35
🎬 Visual: In this scene, two individuals are engaged in a discussion while standing against a purple gradient backdrop, with CNET branding visible in the corner. At the beginning, both subjects appear to be conversing, followed by a close-up focused on the man, and concluding with both returning to a wider shot, gesturing as the conversation continues. The main actions involve dialogue and expressive hand movements, highlighting an interactive and possibly informative exchange.
🎙️ Dialogue:

Scott could be talking like simultaneously and where is he where's Scott are you in the moment am I ever in the moment I yes but I do think about that and I think that when you bring up the display or like this sorry

Duration: 15.0s
Scene 70 (22.564s)
7:37 → 7:59
Frame 1
Frame 1 • 7:39
Frame 2
Frame 2 • 7:48
Frame 3
Frame 3 • 7:57
🎬 Visual: In Scene 70, the video begins with a close-up shot looking through smart glasses, displaying a digital image with a purple backdrop. This is followed by two presenters standing and discussing in front of a gradient purple background, likely explaining or reviewing technology. The scene concludes with a product spotlight featuring an Apple Watch Hermès edition, highlighting its unique watch face and band design.
🎙️ Dialogue:

Yeah, you're not in the moment. You know, you have a phone on your face. Do you think Apple needs to partner with a luxury eyeglass designer, a frame maker, or could Apple just do it on their own? It could go either way, I think. Yeah, I mean, it could be like the Apple Watch bands. You know, it could be like a both. Like they could do in-house design and then have like the Hermes they seem to love, you know. Like they'll trot that out or something.

Duration: 22.6s
Scene 71 (4.129s)
7:59 → 8:03
Frame 1
Frame 1 • 8:00
Frame 2
Frame 2 • 8:01
Frame 3
Frame 3 • 8:03
🎬 Visual: In this scene, a person sits in front of a purple background, initially facing left and remaining relatively still in the first two frames. The scene then transitions to a screen showing the Meta Ray-Ban Display product page, highlighting augmented reality glasses with the caption "See more—without ever looking away." The key shift is from an on-camera discussion or demonstration to a web page that presents details about the featured smart glasses.
Scene 72 (7.299s)
8:03 → 8:11
Frame 1
Frame 1 • 8:04
Frame 2
Frame 2 • 8:07
Frame 3
Frame 3 • 8:10
🎬 Visual: The scene starts with a website showing options to schedule a demo for Meta Ray-Ban Display and Meta Neural Band, followed by an in-person demonstration where a woman puts on an Apple Vision Pro headset at a retail store. The key subjects include the Apple Vision Pro device and demonstration table, with another individual present to assist. Across the frames, the action shifts from navigating online demo appointments to hands-on testing of the device, illustrating a transition from digital scheduling to physical product interaction.
🎙️ Dialogue:

displays have their setting up demo locations. Apple has the Apple stores. Like I thought for years like that would be their Warby Parker, you know, spot.

Duration: 7.3s
Scene 73 (18.185s)
8:11 → 8:29
Frame 1
Frame 1 • 8:12
Frame 2
Frame 2 • 8:20
Frame 3
Frame 3 • 8:27
🎬 Visual: In this scene, two people appear in a studio with a purple background, engaging in a discussion or demonstration. The first frame shows a man gesturing, followed by a woman speaking in the second frame; in the final frame, the woman is holding a pair of black eyeglasses, indicating a transition or exchange of an object. The main notable change across the frames is the shift of focus from the man to the woman and the introduction of the eyeglasses as a key object.
🎙️ Dialogue:

maybe not get your eyeglass fitting completely there, but you could at least kind of virtually try them out online and then look at models in the store. Can I hold them? Yes. Did you want to try them? Because they don't have prescriptions in them. I'm wearing contacts to test them. Okay, I'm wearing contacts too, so.

Duration: 18.2s
Scene 74 (5.297s)
8:29 → 8:34
Frame 1
Frame 1 • 8:29
Frame 2
Frame 2 • 8:31
Frame 3
Frame 3 • 8:33
🎬 Visual: In this scene, two presenters stand in front of a purple studio backdrop and engage in conversation, with one person gesturing with their hand while speaking. The sequence focuses initially on both presenters, transitions to a close-up of the woman as she appears to speak or react, and then returns to a wider view of both individuals interacting. The CNET logo is visible in each frame, indicating a branded studio production setting.
🎙️ Dialogue:

So I will activate. Ah! Do you see it? Yes. Yeah, it just kind of floats there.

Duration: 5.3s
Scene 75 (20.834s)
8:34 → 8:55
Frame 1
Frame 1 • 8:36
Frame 2
Frame 2 • 8:44
Frame 3
Frame 3 • 8:53
🎬 Visual: In this scene, a woman initially interacts with another person while holding a pair of glasses, then transitions to presenting or explaining something alone at a table. The setting remains consistent, featuring a purple backdrop and CNET branding. The main change between frames is the shift from a two-person interaction to a solo presentation focused on the woman at the table.
🎙️ Dialogue:

Oh boy. Well, thank you for putting this into focus for us. Yeah, of course. Let us know your big questions and be sure to check out Scott's coverage of what it's like to use it and follow his experiences with it. And if you are genuinely into Apple doing smart glasses, I want to hear about why in the comments. And I'll catch you next time. There's one more thing.

Duration: 20.8s