TensorRT 10.5.0 -- CPU Memory leak while using nvinfer1::createInferBuilder on 4060 #4281

Iridium771110 · 2024-12-12T12:26:35Z

Description

when I using nvinfer::IBuilder = nvinfer1::createInferBuilder (logger) to create a model engine, it seems a CPU Memory leak.
the full code is shown below.

#include <iostream>
#include <string>
#include <cstring>
#include <NvInfer.h>

class Logger : public nvinfer1::ILogger{
    void log(Severity severity, const char* msg) noexcept override{
        if (severity <= Severity::kWARNING) std::cout<<msg<<std::endl;
    }
};

size_t physical_memory_used_by_process(){
    FILE* file = fopen("/proc/self/status", "r");
    int result = -1;
    char line[128];
    while (fgets(line, 128, file) != nullptr) {
        if (strncmp(line, "VmRSS:", 6) == 0) {
            int len = strlen(line);
            const char* p = line;
            for (; std::isdigit(*p) == false; ++p) {}
            line[len - 3] = 0;
            result = atoi(p);
            break;
        }
    }
    fclose(file);
    return result;
}

int main(int argc, char *argv[]){
    size_t gpu_total_byte_mem;
    size_t gpu_free_byte_mem;
    size_t gpu_alloced_byte_mem;
    cudaDeviceSynchronize();
    cudaMemGetInfo(&gpu_free_byte_mem, &gpu_total_byte_mem);
    gpu_alloced_byte_mem = gpu_total_byte_mem - gpu_free_byte_mem;
    std::cout<<"init status"<<std::endl;
    std::cout<<"used gpu mem: "<<double(gpu_alloced_byte_mem) / 1024.0 / 1024.0<<"MB"<<std::endl;
    std::cout<<"used cpu mem: "<<double(physical_memory_used_by_process()) / 1024.0 <<"MB"<<std::endl;

    Logger logger;

    cudaDeviceSynchronize();
    cudaMemGetInfo(&gpu_free_byte_mem, &gpu_total_byte_mem);
    gpu_alloced_byte_mem = gpu_total_byte_mem - gpu_free_byte_mem;
    std::cout<<"logger created"<<std::endl;
    std::cout<<"used gpu mem: "<<double(gpu_alloced_byte_mem) / 1024.0 / 1024.0<<"MB"<<std::endl;
    std::cout<<"used cpu mem: "<<double(physical_memory_used_by_process()) / 1024.0 <<"MB"<<std::endl;

    nvinfer1::IBuilder* builder = nvinfer1::createInferBuilder(logger);

    cudaDeviceSynchronize();
    cudaMemGetInfo(&gpu_free_byte_mem, &gpu_total_byte_mem);
    gpu_alloced_byte_mem = gpu_total_byte_mem - gpu_free_byte_mem;
    std::cout<<"builder created"<<std::endl;
    std::cout<<"used gpu mem: "<<double(gpu_alloced_byte_mem) / 1024.0 / 1024.0<<"MB"<<std::endl;
    std::cout<<"used cpu mem: "<<double(physical_memory_used_by_process()) / 1024.0 <<"MB"<<std::endl;

    delete builder;

    cudaDeviceSynchronize();
    cudaMemGetInfo(&gpu_free_byte_mem, &gpu_total_byte_mem);
    gpu_alloced_byte_mem = gpu_total_byte_mem - gpu_free_byte_mem;
    std::cout<<"builder deleted"<<std::endl;
    std::cout<<"used gpu mem: "<<double(gpu_alloced_byte_mem) / 1024.0 / 1024.0<<"MB"<<std::endl;
    std::cout<<"used cpu mem: "<<double(physical_memory_used_by_process()) / 1024.0 <<"MB"<<std::endl;

    return 0;
}

build the code and run, I see the memory printed as

so it seems like a CPU memory leak by createInferBuilder API or IBuilder pointer.
do you have any suggestion here? thanks.

Environment

TensorRT Version: 10.5.0

NVIDIA GPU: 4060

NVIDIA Driver Version: 560.35.03

CUDA Version: 12.4

CUDNN Version: --

Operating System:

Python Version (if applicable):

Tensorflow Version (if applicable):

PyTorch Version (if applicable):

Baremetal or Container (if so, version):

Steps To Reproduce

just build the code and run it.

The text was updated successfully, but these errors were encountered:

lix19937 · 2024-12-16T06:37:34Z

You had better to use Valgrind to detect memory management .

asfiyab-nvidia · 2024-12-23T18:22:32Z

@zhenhuaw-me can you please look into this?

zhenhuaw-me · 2024-12-25T09:40:01Z

@Iridium771110 Thank you for reporting this issue. Some CUDA resources such as kernels are released when the process terminated. So the measurement is inaccurate.

As @lix19937 , if you think there is a memory leak, please use valgrind to detect and confirm.

I am going to close this ticket since it's a false alarm. Feel free to reopen if any further concerns.

asfiyab-nvidia added Engine Build Issues with engine build triaged Issue has been triaged by maintainers labels Dec 23, 2024

asfiyab-nvidia assigned zhenhuaw-me Dec 23, 2024

zhenhuaw-me closed this as completed Dec 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TensorRT 10.5.0 -- CPU Memory leak while using nvinfer1::createInferBuilder on 4060 #4281

TensorRT 10.5.0 -- CPU Memory leak while using nvinfer1::createInferBuilder on 4060 #4281

Iridium771110 commented Dec 12, 2024

lix19937 commented Dec 16, 2024

asfiyab-nvidia commented Dec 23, 2024

zhenhuaw-me commented Dec 25, 2024

TensorRT 10.5.0 -- CPU Memory leak while using nvinfer1::createInferBuilder on 4060 #4281

TensorRT 10.5.0 -- CPU Memory leak while using nvinfer1::createInferBuilder on 4060 #4281

Comments

Iridium771110 commented Dec 12, 2024

Description

Environment

Steps To Reproduce

lix19937 commented Dec 16, 2024

asfiyab-nvidia commented Dec 23, 2024

zhenhuaw-me commented Dec 25, 2024