Issue with Saving Depth Information in PNG Frames and Reading Unexpected Values #12564

ronbitonn · 2024-01-10T11:15:42Z

Problem Description:
I am currently facing an issue with my code for capturing RGB and depth information using an Intel D435 camera. While my code successfully saves frames as PNG files, I encounter unexpected values when attempting to read the depth information. The depth array appears to represent RGB values instead of actual depth values.

Initial Code for Capturing RGB and Depth Frames:

import pyrealsense2 as rs
import cv2
import numpy as np
import time
import datetime
import os

###Function to visualize the depth image as a colored image based on distance values.
def visualize_depth_as_color(depth_image):
depth_colormap = cv2.applyColorMap(cv2.convertScaleAbs(depth_image, alpha=0.03), cv2.COLORMAP_JET)
return depth_colormap

###Constants for recording
NUM_RECORDINGS = 480
BREAK_TIME = 2
OUTPUT_DIRECTORY = "/data/records/-----"
width_image = 1280
height_image = 720
video_length = 60

###Initialize context for RealSense and query for devices
ctx = rs.context()
devices = ctx.query_devices()

###Check if there's at least one device connected
if len(devices) >= 1:
#Use the first detected device
first_device = devices[0]
#Get the serial number of the device for identification
first_device_serial_number = first_device.get_info(rs.camera_info.serial_number)

  #Initialize pipeline for recording
  pipeline_record = rs.pipeline()
  config_record = rs.config()
  # Configure the device and streams for recording
  config_record.enable_device(first_device_serial_number)
  config_record.enable_stream(rs.stream.color, width_image, height_image, rs.format.rgb8, 30)
  config_record.enable_stream(rs.stream.depth, width_image, height_image, rs.format.z16, 30)

  #Access the color sensor and set its options
  color_sensor_record = first_device.query_sensors()[1]
  color_sensor_record.set_option(rs.option.auto_exposure_priority, False)
  color_sensor_record.set_option(rs.option.sharpness, 100)

  #Record multiple times as per NUM_RECORDINGS
  for i in range(NUM_RECORDINGS):
      now = datetime.datetime.now()
      #Create a unique timestamp for the recording
      timestamp = now.strftime("%Y%m%d_%H%M%S_%f")
      #Set up the directory to save the frames
      output_directory = os.path.join(OUTPUT_DIRECTORY, first_device_serial_number, timestamp)
      os.makedirs(output_directory, exist_ok=True)

      #Start the recording pipeline
      pipeline_record_profile = pipeline_record.start(config_record)

      try:
          #Pause to let the camera initialize properly
          print('Pausing.', end='')
          camera_startup_pause = 2
          for t in range(camera_startup_pause * 2):
              print('\r', 'Pausing', '.' * (t + 2), sep='', end='')
              time.sleep(0.5)
          print()

          #Start recording and displaying frames
          start = time.time()
          frame_index = 0
          while time.time() - start < int(video_length):
              frames = pipeline_record.wait_for_frames()
              #Get RGB and Depth frames
              color_frame = frames.get_color_frame()
              depth_frame = frames.get_depth_frame()

              #Skip if any of the frames are missing
              if not color_frame or not depth_frame:
                  continue

              #Convert frames to numpy arrays for processing and visualization
              color_image = np.asanyarray(color_frame.get_data())
              color_image_rgb = cv2.cvtColor(color_image, cv2.COLOR_BGR2RGB)
              depth_image = np.asanyarray(depth_frame.get_data())
              #Colorize the depth image for visualization
              depth_colormap = visualize_depth_as_color(depth_image)

              #Display the frames
              cv2.imshow('Depth Color', depth_colormap)
              cv2.imshow('RGB', color_image_rgb)

              #Save frames at every 5th iteration (modifiable)
              if frame_index % 5 == 0:
                  timestamp_per_frame = datetime.datetime.now().strftime("%Y%m%d_%H%M%S_%f")
                  filename_rgb = f"frame_rgb_{frame_index}_{timestamp_per_frame}.png"
                  filename_depth = f"frame_depth_{frame_index}_{timestamp_per_frame}.png"

                  cv2.imwrite(os.path.join(output_directory, filename_rgb), color_image_rgb)
                  cv2.imwrite(os.path.join(output_directory, filename_depth), depth_colormap)

              #Break out of loop if 'q' is pressed
              if cv2.waitKey(1) & 0xFF == ord('q'):
                  break

              frame_index += 1

      finally:
          #Stop the recording pipeline after recording is done or an error occurs
          pipeline_record.stop()

      #Print the directory where the frames are saved
      print(f"Recorded frames saved to: {output_directory}")

      #Pause for a short time before next recording
      time.sleep(BREAK_TIME)

###Close all OpenCV windows after recordings are done
cv2.destroyAllWindows()

Original Code for Reading Depth Information:

For context, here is the code for reading depth information from saved frames

import cv2
import os
import numpy as np

DEPTH_DIRECTORY = list_false_visit[1]
depth_files = [file for file in sorted(os.listdir(DEPTH_DIRECTORY))]

for depth_file in depth_files[:3]:
depth_image_path = os.path.join(DEPTH_DIRECTORY, depth_file)
depth_image = cv2.imread(depth_image_path, cv2.IMREAD_UNCHANGED)

print(f"Depth information for the first 10 pixels in {depth_file}:")

for y in range(min(10, depth_image.shape[0])):
    for x in range(min(10, depth_image.shape[1])):
        depth_value = depth_image[y, x]
        print(f"Pixel ({x}, {y}): Depth = {depth_value} mm")

Output:

mathematica
Copy code
Depth information for the first 10 pixels in frame_depth_0725_20231003_143459_016758.png:
Pixel (0, 0): Depth = [128 0 0] mm
Pixel (1, 0): Depth = [128 0 0] mm
Pixel (2, 0): Depth = [128 0 0] mm
...
Desired Outcome:
I expect to retrieve the actual depth values for each pixel instead of an array of RGB values.

Recommendations Sought:
I am considering two alternatives:

Recording frames as a ROSbag file, then converting RGB frames to PNG and saving depth information as a text file for each frame. However, the ROSbag files are excessively large (around 10GB for 1 minute).
Exploring a way to convert the depth array when reading from PNG files to obtain actual depth values for each pixel.
I would appreciate guidance or recommendations on the best approach for achieving my goal while managing file size efficiently.

Please review the issue and assist in resolving the problem with reading unexpected values from the depth frames.

The text was updated successfully, but these errors were encountered:

MartyG-RealSense · 2024-01-10T11:47:42Z

Hi @ronbitonn When saving depth data to PNG files, most of the depth information is lost. There is no way to avoid this when saving to PNG, unfortunately. If possible it is better to save to the .raw image format which preserves depth information.

The best RealSense reference about reading depth data from a .raw file (which follows the UVC specification for the depth stream) is at #2231 (comment)

ronbitonn · 2024-01-10T16:55:23Z

Hi @MartyG-RealSense ,

I hope this message finds you well. I've been facing an issue with capturing RGB and depth information using the Intel D435 camera and saving them s PNG files. When attempting to read the depth information from these PNG files, I encounter unexpected values that represent RGB values instead of the actual depth values.

After reviewing your guidance on the limitations of PNG for preserving depth information, I'm seeking recommendations on how to efficiently save both RGB and depth data for subsequent use in object detection algorithms.

My goal is to save the PNG files for RGB frames and, for each frame, save a corresponding matrix of depth information in the size of the RGB image. Ultimately, I aim to create a folder with X*60 frames, along with an equal number of text files containing depth information. This structure will enable me to correlate depth information with RGB data, facilitating the development of algorithms for object detection.

Could you kindly guide the best approach for achieving this goal while ensuring that depth information is accurately preserved for subsequent analysis?

Thank you in advance for your assistance.

MartyG-RealSense · 2024-01-11T11:48:10Z

#4934 (comment) has a Python script for saving RGB as a PNG image and depth as an array of scaled matrices saved as an .npy file.

MartyG-RealSense · 2024-01-17T08:41:16Z

Hi @ronbitonn Do you require further assistance with this case, please? Thanks!

MartyG-RealSense · 2024-01-26T07:31:41Z

Case closed due to no further comments received.

ronbitonn · 2024-02-10T18:42:04Z

Subject: Performance Issue in Bag File Conversion to JPEG and Depth Arrays with Jetson Orin and RealSense D435 Camera

Issue:

While using the provided scripts with a Jetson Orin and a RealSense D435 camera, there is a noticeable performance concern during the conversion of rosbag files to JPEG images and depth arrays. The current implementation takes approximately 30 seconds for the conversion of one minute of recorded data, which is suboptimal for real-time or large-scale applications.

Platform Details:

Jetson Orin
Camera: Intel RealSense D435
Problem Description:

The observed bottleneck lies in the code responsible for processing and converting the frames from the rosbag files to JPEG images and depth arrays. This performance concern might impact the scalability and real-time usability of the solution on the Jetson Orin platform.

Codespython3 record_multiple_bagfiles_d435.py

import os
import pyrealsense2 as rs
import cv2
import numpy as np
import time
import datetime

import rosbag

NUM_RECORDINGS = 20
BREAK_TIME = 2
video_length = 60

ctx = rs.context()
devices = ctx.query_devices()

if len(devices) >= 1:
first_device = devices[0]
first_device_serial_number = first_device.get_info(rs.camera_info.serial_number)

pipeline_record = rs.pipeline()
config_record = rs.config()
config_record.enable_device(first_device_serial_number)

config_record.enable_stream(rs.stream.color, 1280, 720, rs.format.rgb8, 30)
config_record.enable_stream(rs.stream.depth, 1280, 720, rs.format.z16, 30)

color_sensor_record = first_device.query_sensors()[1]
color_sensor_record.set_option(rs.option.auto_exposure_priority, False)
color_sensor_record.set_option(rs.option.sharpness, 100)

for i in range(NUM_RECORDINGS):
    now = datetime.datetime.now()
    timestamp = now.strftime("%Y%m%d_%H%M%S_%f")
    output_filename = f"/data/records/records_ivri_farm/{timestamp}"
    bag_filename = output_filename + ".bag"
    config_record.enable_record_to_file(bag_filename)

    pipeline_record_profile = pipeline_record.start(config_record)
    device_record = pipeline_record_profile.get_device()
    device_recorder = device_record.as_recorder()

    color_sensor_record = device_record.query_sensors()[1]
    color_sensor_record.set_option(rs.option.auto_exposure_priority, False)

    try:
        rs.recorder.pause(device_recorder)
        print('Pausing.', end='')
        camera_startup_pause = 2
        for t in range(camera_startup_pause*2):
            print('\r','Pausing','.'*(t+2), sep='', end='')
            time.sleep(0.5)

        print()    
        rs.recorder.resume(device_recorder)

        colorizer = rs.colorizer()

        start=time.time()
        while time.time() - start < int(video_length):
            frames = pipeline_record.wait_for_frames()
            frames.keep()

            color_frame = frames.get_color_frame()
            depth_frame = frames.get_depth_frame()

            color_frame = np.asanyarray(color_frame.get_data())
            color_frame = cv2.cvtColor(color_frame, cv2.COLOR_RGB2BGR)
            depth_frame = np.asanyarray(colorizer.colorize(depth_frame).get_data())

            camera_images = np.vstack((color_frame, depth_frame))

            camera_pair = first_device_serial_number
            cv2.namedWindow('Camera ' + str(camera_pair), cv2.WINDOW_AUTOSIZE)
            cv2.imshow('Camera ' + str(camera_pair), camera_images)
            cv2.waitKey(5)

    finally:
        pipeline_record.stop()
        cv2.destroyAllWindows()

    print(f"Recorded video saved to: {bag_filename}")

    time.sleep(BREAK_TIME)

Second code

#!/usr/bin/env python3

import os
import numpy as np
import cv2
import rosbag
from datetime import datetime, timedelta

bag_directory = "/data/records/records_ivri_farm/new_31_01_2024"

for bag_file in os.listdir(bag_directory):
if bag_file.endswith(".bag"):
bag_path = os.path.join(bag_directory, bag_file)

    pipeline = rosbag.Bag(bag_path)

    bag_file_name = os.path.splitext(os.path.basename(bag_path))[0]
    date_str = bag_file_name[:8]
    start_time_str = bag_file_name[9:15]

    def add_seconds_to_start_time(start_time_str, seconds):
        start_time = datetime.strptime(start_time_str, "%H%M%S")
        new_time = start_time + timedelta(seconds=seconds)
        return new_time.strftime("%H%M%S")

    output_dir = f"/data/records/records_ivri_farm/new_31_01_2024{bag_file_name}"
    os.makedirs(output_dir, exist_ok=True)
    print("create output_dir - " + output_dir)

    playback_speed = 5
    color_frame_count = 0
    depth_frame_count = 0

    for topic, msg, t in pipeline.read_messages():
        timestamp_seconds = t.to_sec()
        adjusted_time_str = add_seconds_to_start_time(start_time_str, timestamp_seconds)

        if topic == "/device_0/sensor_1/Color_0/image/data":
            color_image = np.frombuffer(msg.data, dtype=np.uint8).reshape(msg.height, msg.width, -1)

            if color_frame_count % 6 == 0:
                color_image_path = f"{output_dir}/{date_str}_{adjusted_time_str}_{color_frame_count}_color.jpg"
                cv2.imwrite(color_image_path, cv2.cvtColor(color_image, cv2.COLOR_BGR2RGB))

            color_frame_count += 1

        elif topic == "/device_0/sensor_0/Depth_0/image/data":
            depth_image = np.frombuffer(msg.data, dtype=np.uint16).reshape(msg.height, msg.width)

            if depth_frame_count % 6 == 0:
                depth_npy_path = f"{output_dir}/{date_str}_{adjusted_time_str}_{depth_frame_count}_depth.npy"
                np.save(depth_npy_path, depth_image)

            depth_frame_count += 1

    pipeline.close()

MartyG-RealSense · 2024-02-11T11:19:14Z

Hi @ronbitonn If you increase the frame queue size then the pipeline will be able to contain more frames simultaneously (the default maximum pipeline capacity is 16 frames for each stream type), though this increases the amount of computer memory that is consumed. Python code for setting the frame queue size can be found at #6448 (comment)

ronbitonn · 2024-02-11T17:19:27Z

Hi @MartyG-RealSense

I appreciate your suggestion on increasing the frame queue size. Currently, due to storage limitations, I record as a rosbag file first and then convert to JPEGs and depth arrays separately. Any recommendations for a more efficient approach?

Additionally, if there's an option to reduce the number of frames and depth arrays saved per second in the rosbag file, it would be beneficial for storage optimization on my Jetson Orin.

Thanks for your advice.

MartyG-RealSense · 2024-02-12T08:18:46Z

Rosbags are the most efficient method of writing multiple frames of RealSense data.

A way to set a custom frame rate is to only use every 'nth' frame (e.g every 5th frame out of 30 when using 30 FPS to simulate 6 FPS). A Python script that demonstrates this technique is at #3169

ronbitonn · 2024-02-13T05:45:24Z

ok @MartyG-RealSense thank you very much I will check it out and update soon. Appreciate your help a lot.

MartyG-RealSense added the D400 Series label Jan 10, 2024

MartyG-RealSense closed this as completed Jan 26, 2024

FANFANFAN2506 mentioned this issue Feb 13, 2024

Verify that the kernel 6.2.8 with no realsense string #12581

Closed

MartyG-RealSense mentioned this issue Oct 28, 2024

Implement Pre-trigger Recording #13447

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with Saving Depth Information in PNG Frames and Reading Unexpected Values #12564

Issue with Saving Depth Information in PNG Frames and Reading Unexpected Values #12564

ronbitonn commented Jan 10, 2024 •

edited

Loading

MartyG-RealSense commented Jan 10, 2024

ronbitonn commented Jan 10, 2024

MartyG-RealSense commented Jan 11, 2024 •

edited

Loading

MartyG-RealSense commented Jan 17, 2024

MartyG-RealSense commented Jan 26, 2024

ronbitonn commented Feb 10, 2024 •

edited

Loading

MartyG-RealSense commented Feb 11, 2024

ronbitonn commented Feb 11, 2024

MartyG-RealSense commented Feb 12, 2024

ronbitonn commented Feb 13, 2024

Issue with Saving Depth Information in PNG Frames and Reading Unexpected Values #12564

Issue with Saving Depth Information in PNG Frames and Reading Unexpected Values #12564

Comments

ronbitonn commented Jan 10, 2024 • edited Loading

Initial Code for Capturing RGB and Depth Frames:

For context, here is the code for reading depth information from saved frames

MartyG-RealSense commented Jan 10, 2024

ronbitonn commented Jan 10, 2024

MartyG-RealSense commented Jan 11, 2024 • edited Loading

MartyG-RealSense commented Jan 17, 2024

MartyG-RealSense commented Jan 26, 2024

ronbitonn commented Feb 10, 2024 • edited Loading

Second code

MartyG-RealSense commented Feb 11, 2024

ronbitonn commented Feb 11, 2024

MartyG-RealSense commented Feb 12, 2024

ronbitonn commented Feb 13, 2024

ronbitonn commented Jan 10, 2024 •

edited

Loading

MartyG-RealSense commented Jan 11, 2024 •

edited

Loading

ronbitonn commented Feb 10, 2024 •

edited

Loading