AFTER Google Summer of Code 2024: Google Protocol Buffers Technology

Saif Kandil

Just a place for me to dump my thoughts about my Google Summer of Code 2024 project

AFTER Google Summer of Code 2024: Google Protocol Buffers Technology

Oct 13, 2024

blog

This blog post is related to my Google Summer of Code 2024 project: Procedural Fragment Shader Generation Using Classic Machine Learning.

I have learnt so much things including the right way to write code. I can say now that I can write code better because I don’t care about how good it looks. I care about how others can understand it.

You know what was the problem in my work? I didn’t follow ENIGMA’s way of doing things. So in order to explain this, consider the code snippet below:

struct Graph {
    std::map<int, VisualShader::Node> nodes;
    std::vector<VisualShader::Connection> connections;
} graph;

This is where I save my nodes and my connections. This all is being saved dynamically. However, in ENIGMA, we should be able to serialize and deserialize this data in a file called “project file”. Got it? RadialGM is an IDE that is able to load/save full projects. How ENIGMA do it? Well, they use Model-View-Controller (MVC) architecture. But that’s not everything. They also use Google Protocol Buffers as their model. This means everything you do in your project is being sent to Protobuf and then serialize this model to YAML.

By the way, ENIGMA doesn’t use Protobuf’s built-in serialization/deserialization functions. However, ENIGMA has its own module for managing all that called libEGM. This module takes a mutable message using Reflection and then serialize it to YAML. This is how ENIGMA saves its project files.

So here is what I must do:

I need to make an integration between the Visual Shader Editor and the model.
I need to change the VisualShader class to become only a Generator.

And this what Josh was trying to say to me. I have covered this in Google Summer of Code 2024 Week 18, 19, and 20: Wrapping Up and Final Evaluation.

Separating my concerns means NO NEED TO STORE GUI RELATED DATA IN MY BACKEND. Anything that is related to the frontend should be stored in the protobuf model. So don’t do this:

std::string VisualShaderNodeFloatConstant::get_caption() const { return "FloatConstant"; }

int VisualShaderNodeFloatConstant::get_input_port_count() const { return 0; }

VisualShaderNode::PortType VisualShaderNodeFloatConstant::get_input_port_type([[maybe_unused]] const int& port) const {
  return PORT_TYPE_SCALAR;
}

std::string VisualShaderNodeFloatConstant::get_input_port_name([[maybe_unused]] const int& port) const {
  return std::string();
}

int VisualShaderNodeFloatConstant::get_output_port_count() const { return 1; }

VisualShaderNode::PortType VisualShaderNodeFloatConstant::get_output_port_type([[maybe_unused]] const int& port) const {
  return PORT_TYPE_SCALAR;
}

std::string VisualShaderNodeFloatConstant::get_output_port_name([[maybe_unused]] const int& port) const { return ""; }

However, do this:

message VisualShaderNodeFloatConstant {
    option (node_caption) = "Float Constant";
    option (node_input_port_count) = 0;
    option (node_input_port_type) = PORT_TYPE_SCALAR;
    option (node_input_port_caption) = "";
    
    option (node_output_port_count) = 1;
    option (node_output_port_type) = PORT_TYPE_SCALAR;
    option (node_output_port_caption) = "";

    optional double value = 1;
}

This means ports, captions, and values should be stored in the protobuf model. This is how ENIGMA does it. This is how I should do it.

This means the VisualShader class should only be a generator. It should not store any data. It should only takes a number of nodes and connections and then generate a shader code.

After working on RGM for so long, I have decided to complete this on a simpler version of RGM because RGM is really big and my CPU is crying on every build. Also, see the pic below? This issue happens too much on RGM’s codebase. It is so difficult to determine the reason. Why? Well, consider the code snippet below:

const google::protobuf::FieldDescriptor* field {some_descriptor->FindFieldByNumber(some_field_number)};
if (field == nullptr) {
    std::cerr << "Field not found: " << field->full_name() << std::endl;
}

Notice the problem? If field is nullptr, then field->full_name() will cause a segmentation fault. However, the issue in the pic below will show up. I have no reasonable explanation for this and this is just a simple example. RGM’s codebase is considered to be legacy (in my opinion) with enough complexity that makes it difficult to determine the reason for the issue.

Protobuf Runtime Issue

After working continuously on my mini-RGM, I can finally guess what RGM’s model needs. I have created a separate model for the oneofs and this helps with separating the complexiness from the MessageModel class becasue according to Protobuf’s syntax, oneofs are only allowed to be in the message scope.

Of course, RGM’s model is way complex than mine. RGM’s model employs many - if not all - features of QAbstractItemModel such as role and also not to mention that RGM’s model is made for many editors at once (not only one such as mine heh).

Anyway, I have made it and now my model works fine starting from b17ce5b.

Diagnosing and Resolving Protobuf Pointer Access Crashes in Debug Mode

One of the reasons I left RGM and started working on the project in a clean codebase is that while debugging RGM, crashes happens when I reach a certain point in the code. Consider the following below code snippet, when I reach step to line 11, crashes happens. I didn’t know why back then because sometimes they happen and sometimes they don’t.

 1: void MessageModel::RebuildSubModels() {
 2:   submodels_by_field_.clear();
 3:   submodels_by_row_.clear();
 4:   R_EXPECT_V(_protobuf) << "Internal protobuf null";
 5: 
 6:   const Descriptor *desc = _protobuf->GetDescriptor();
 7:   const Reflection *refl = _protobuf->GetReflection();
 8:   submodels_by_row_.resize(desc->field_count());
 9: 
10:   for (int i = 0; i < desc->field_count(); i++) {
11:     const FieldDescriptor *field = desc->field(i);
12: 
13:     if (field->is_repeated()) {
14:       switch (field->cpp_type()) {
15:         case CppType::CPPTYPE_ENUM: {
16:           qDebug() << "ENUMs not yet handled";
17:           break;
18:         }
19:         case CppType::CPPTYPE_MESSAGE: {
20:           submodels_by_field_[field->number()] = submodels_by_row_[i] = new RepeatedMessageModel(this, _protobuf, field);
21:           break;
22:         }
23:         case CppType::CPPTYPE_BOOL: {
24:           submodels_by_field_[field->number()] = submodels_by_row_[i] = new RepeatedBoolModel(this, _protobuf, field);
25:           break;
26:         }
27:         case CppType::CPPTYPE_INT32: {
28:           submodels_by_field_[field->number()] = submodels_by_row_[i] = new RepeatedInt32Model(this, _protobuf, field);
29:           break;
30:         }
31:         case CppType::CPPTYPE_INT64: {
32:           submodels_by_field_[field->number()] = submodels_by_row_[i] = new RepeatedInt64Model(this, _protobuf, field);
33:           break;
34:         }
35:         case CppType::CPPTYPE_UINT32: {
36:           submodels_by_field_[field->number()] = submodels_by_row_[i] = new RepeatedUInt32Model(this, _protobuf, field);
37:           break;
38:         }
39:         case CppType::CPPTYPE_UINT64: {
40:           submodels_by_field_[field->number()] = submodels_by_row_[i] = new RepeatedUInt64Model(this, _protobuf, field);
41:           break;
42:         }
43:         case CppType::CPPTYPE_FLOAT: {
44:           submodels_by_field_[field->number()] = submodels_by_row_[i] = new RepeatedFloatModel(this, _protobuf, field);
45:           break;
46:         }
47:         case CppType::CPPTYPE_DOUBLE: {
48:           submodels_by_field_[field->number()] = submodels_by_row_[i] = new RepeatedDoubleModel(this, _protobuf, field);
49:           break;
50:         }
51:         case CppType::CPPTYPE_STRING: {
52:           submodels_by_field_[field->number()] = submodels_by_row_[i] = new RepeatedStringModel(this, _protobuf, field);
53:           break;
54:         }
55:       }
56:     } else if (field->cpp_type() == CppType::CPPTYPE_MESSAGE) {
57:       // Ignore all unset oneof fields if any is set
58:       if (IsCulledOneof_(refl, *_protobuf, field)) continue;
59:       // Only recursively build fields if they're set
60:       if (refl->HasField(*_protobuf, field)) {
61:         submodels_by_field_[field->number()] = submodels_by_row_[i] =
62:             new MessageModel(this, refl->MutableMessage(_protobuf, field), i);
63:       } else {
64:         submodels_by_field_[field->number()] = submodels_by_row_[i] = new MessageModel(this, field->message_type(), i);
65:       }
66:     } else {
67:       submodels_by_field_[field->number()] = submodels_by_row_[i] = new PrimitiveModel(this, field);
68:     }
69:   }
70: }

The thing is, I noticed the same problem while working with protobuf and MSVC but this time, an access to a valid pointer crashes the system. The pointer is valid because the same code works fine using GCC. The problem was that mixing debug and release libraries in MSVC causes the crash. More precisely, it depends on setting CMAKE_MSVC_RUNTIME_LIBRARY correctly.

Given that the above information, I think the problem is not with RGM however, it is in how I was building protobuf, I think. Anyway, whoever reads this post, check this shell script for ubuntu build_protobuf_ubuntu.sh. You can invoke it:

chmod +x ./build_protobuf_ubuntu.sh
sudo ./build_protobuf_ubuntu.sh Debug Dynamic

or whatever parameters you want. This script will build protobuf in Debug mode and Dynamic linking. I hope this helps.