proto: performance degradation with v1.1.0 #624

srenatus · 2018-06-01T08:33:41Z

Hi there -- updating our dependency on golang/protobuf to v1.1.0, I've noticed an unexpected dip in performance when running some microbenchmarks.

👉 I've put up a reproduction case here: https://github.com/srenatus/pb-bench -- you can also look at the output on travis

The gist is that marshaling (of a rather big message) is getting a lot worse in all three metrics; ranging from +165.77% (alloc/op) to +751.93% (allocs/op).

I've brought this up in #protobuf of the gopher slack, where @dsnet asked for further input -- here it is (👋 @dsnet 😃).

The text was updated successfully, but these errors were encountered:

dsnet · 2018-06-01T09:10:13Z

\cc @randall77 @LMMilewski

srenatus · 2018-06-18T11:58:40Z

Heya -- Is there any way I could help you with this? 😃

LMMilewski · 2018-06-19T19:41:28Z

Could you add a LICENSE file?

LMMilewski · 2018-06-19T19:57:17Z

Alternatively, could you run both benchmarks with --test.cpuprofile flag and attach generated files?

srenatus · 2018-06-19T20:43:34Z

@LMMilewski ✔️ Added a LICENSE. :)

LMMilewski · 2018-06-19T21:11:30Z

It seems that the problem is with processing maps (see profiles below). Your proto contains a large, deeply nested map ("node" key in https://github.com/srenatus/pb-bench/blob/master/data/converge-success-report.json is a google.protobuf.Struct which is a map that can contain nested maps https://github.com/golang/protobuf/blob/master/ptypes/struct/struct.proto#L52-L55).

Maps processing is expected to be slow. That path was considered rare and uses reflection extensively. That was true in 1.0.0 too but that version didn't reflectively iterate over maps to calculate their sizes (which is what shows up in the profile).

Also, calculating size is at the top of the v1.1.0 profile accounting for 65.36%. Perhaps it is quadratic in some case (probably maps) which would happen if it's not cached.

v1.0.0 top

Showing nodes accounting for 3.14s, 94.01% of 3.34s total
Dropped 53 nodes (cum <= 0.02s)
Showing top 100 nodes out of 130
      flat  flat%   sum%        cum   cum%
     0.29s  8.68%  8.68%      0.29s  8.68%  runtime.memmove
     0.26s  7.78% 16.47%      0.42s 12.57%  runtime.mapaccess2
     0.18s  5.39% 21.86%      0.42s 12.57%  runtime.mallocgc
     0.17s  5.09% 26.95%      2.84s 85.03%  github.com/srenatus/pb-bench/v1.0.0/vendor/github.com/golang/protobuf/proto.(*Buffer).enc_len_thing
     0.15s  4.49% 31.44%      0.15s  4.49%  runtime.futex
     0.10s  2.99% 34.43%      0.10s  2.99%  github.com/srenatus/pb-bench/v1.0.0/vendor/github.com/golang/protobuf/proto.(*Buffer).EncodeVarint
     0.10s  2.99% 37.43%      0.10s  2.99%  runtime.mapiternext
     0.08s  2.40% 39.82%      2.84s 85.03%  github.com/srenatus/pb-bench/v1.0.0/vendor/github.com/golang/protobuf/proto.(*Buffer).enc_struct
     0.08s  2.40% 42.22%      0.38s 11.38%  reflect.Value.MapIndex
     0.08s  2.40% 44.61%      0.08s  2.40%  runtime.heapBitsSetType
     0.07s  2.10% 46.71%      2.81s 84.13%  github.com/srenatus/pb-bench/v1.0.0/vendor/github.com/golang/protobuf/proto.(*Buffer).enc_new_map
     0.07s  2.10% 48.80%      0.51s 15.27%  reflect.Value.MapKeys
     0.07s  2.10% 50.90%      0.07s  2.10%  runtime.greyobject
     0.06s  1.80% 52.69%      2.71s 81.14%  github.com/srenatus/pb-bench/v1.0.0/vendor/github.com/golang/protobuf/ptypes/struct._Value_OneofMarshaler
     0.06s  1.80% 54.49%      0.18s  5.39%  reflect.Value.Set
     0.06s  1.80% 56.29%      0.06s  1.80%  runtime.(*itabTableType).find
     0.06s  1.80% 58.08%      0.06s  1.80%  runtime.memclrNoHeapPointers
     0.06s  1.80% 59.88%      0.08s  2.40%  runtime.resolveTypeOff
     0.06s  1.80% 61.68%      0.13s  3.89%  runtime.scanobject
     0.05s  1.50% 63.17%      0.05s  1.50%  reflect.packEface

v1.1.0 top

     0.13s  7.26%  7.26%      1.17s 65.36%  github.com/srenatus/pb-bench/v1.1.0/vendor/github.com/golang/protobuf/proto.(*marshalInfo).size
     0.11s  6.15% 13.41%      0.13s  7.26%  runtime.heapBitsSetType
     0.09s  5.03% 18.44%      0.10s  5.59%  reflect.Value.Elem
     0.08s  4.47% 22.91%      0.36s 20.11%  runtime.mallocgc
     0.07s  3.91% 26.82%      0.09s  5.03%  runtime.mapaccess2
     0.07s  3.91% 30.73%      0.10s  5.59%  runtime.mapiternext
     0.06s  3.35% 34.08%      0.11s  6.15%  runtime.mapaccess1
     0.06s  3.35% 37.43%      0.35s 19.55%  runtime.newobject
     0.06s  3.35% 40.78%      0.06s  3.35%  runtime.nextFreeFast (inline)
     0.04s  2.23% 43.02%      1.05s 58.66%  github.com/srenatus/pb-bench/v1.1.0/vendor/github.com/golang/protobuf/proto.makeMapMarshaler.func1
     0.04s  2.23% 45.25%      1.35s 75.42%  github.com/srenatus/pb-bench/v1.1.0/vendor/github.com/golang/protobuf/proto.makeMessageMarshaler.func2
     0.04s  2.23% 47.49%      0.10s  5.59%  reflect.(*rtype).ptrTo
     0.04s  2.23% 49.72%      0.05s  2.79%  reflect.Value.Interface
     0.04s  2.23% 51.96%      0.14s  7.82%  reflect.Value.MapIndex
     0.04s  2.23% 54.19%      0.04s  2.23%  runtime.aeshash64
     0.04s  2.23% 56.42%      0.04s  2.23%  runtime.futex
     0.03s  1.68% 58.10%      1.16s 64.80%  github.com/srenatus/pb-bench/v1.1.0/vendor/github.com/golang/protobuf/proto.makeOneOfMarshaler.func1
     0.03s  1.68% 59.78%      0.03s  1.68%  github.com/srenatus/pb-bench/v1.1.0/vendor/github.com/golang/protobuf/proto.pointer.getPointer (inline)
     0.03s  1.68% 61.45%      0.14s  7.82%  reflect.NewAt
     0.03s  1.68% 63.13%      0.03s  1.68%  runtime.heapBitsForObject

randall77 · 2018-06-19T21:29:44Z

@cherryyz Any idea if we can make Size faster in this case?

randall77 · 2018-06-19T21:30:18Z

@cherrymui

LMMilewski · 2018-06-19T21:50:47Z

FWIW. I created a small benchmark (code below) with nested maps (depth 1, 2, 4, 8, 16). Every map has a single key and the value is either a nested map or a number (the leaf). I got those results:

Benchmark1-40     	 1000000	      1540 ns/op
Benchmark2-40     	  500000	      3777 ns/op
Benchmark4-40     	  200000	      9927 ns/op
Benchmark8-40     	   50000	     29783 ns/op
Benchmark16-40    	   20000	     98279 ns/op

So it grows faster than it should.

The web graph shows that size is driven by map and oneof. I'm not sure how to attach an svg here though.

Code:


func Benchmark1(b *testing.B) {
	m := &pb.Struct{
		Fields: map[string]*pb.Value{
			"0": &pb.Value{Kind: &pb.Value_NumberValue{1}},
		},
	}
	for i := 0; i < b.N; i++ {
		if _, err := proto.Marshal(m); err != nil {
			b.Fatal(err)
		}
	}
}

func Benchmark2(b *testing.B) {
	m := &pb.Struct{
		Fields: map[string]*pb.Value{
			"0": &pb.Value{
				Kind: &pb.Value_StructValue{
					&pb.Struct{
						Fields: map[string]*pb.Value{
							"0": &pb.Value{Kind: &pb.Value_NumberValue{1}},
						},
					},
				},
			},
		},
	}
	for i := 0; i < b.N; i++ {
		if _, err := proto.Marshal(m); err != nil {
			b.Fatal(err)
		}
	}
}

func Benchmark4(b *testing.B) {
	m := &pb.Struct{
		Fields: map[string]*pb.Value{
			"0": &pb.Value{
				Kind: &pb.Value_StructValue{
					&pb.Struct{
						Fields: map[string]*pb.Value{
							"0": &pb.Value{
								Kind: &pb.Value_StructValue{
									&pb.Struct{
										Fields: map[string]*pb.Value{
											"0": &pb.Value{
												Kind: &pb.Value_StructValue{
													&pb.Struct{
														Fields: map[string]*pb.Value{
															"0": &pb.Value{Kind: &pb.Value_NumberValue{1}},
														},
													},
												},
											},
										},
									},
								},
							},
						},
					},
				},
			},
		},
	}
	for i := 0; i < b.N; i++ {
		if _, err := proto.Marshal(m); err != nil {
			b.Fatal(err)
		}
	}
}

func Benchmark8(b *testing.B) {
	m := &pb.Struct{
		Fields: map[string]*pb.Value{
			"0": &pb.Value{
				Kind: &pb.Value_StructValue{
					&pb.Struct{
						Fields: map[string]*pb.Value{
							"0": &pb.Value{
								Kind: &pb.Value_StructValue{
									&pb.Struct{
										Fields: map[string]*pb.Value{
											"0": &pb.Value{
												Kind: &pb.Value_StructValue{
													&pb.Struct{
														Fields: map[string]*pb.Value{
															"0": &pb.Value{
																Kind: &pb.Value_StructValue{
																	&pb.Struct{
																		Fields: map[string]*pb.Value{
																			"0": &pb.Value{
																				Kind: &pb.Value_StructValue{
																					&pb.Struct{
																						Fields: map[string]*pb.Value{
																							"0": &pb.Value{
																								Kind: &pb.Value_StructValue{
																									&pb.Struct{
																										Fields: map[string]*pb.Value{
																											"0": &pb.Value{
																												Kind: &pb.Value_StructValue{
																													&pb.Struct{
																														Fields: map[string]*pb.Value{
																															"0": &pb.Value{Kind: &pb.Value_NumberValue{1}},
																														},
																													},
																												},
																											},
																										},
																									},
																								},
																							},
																						},
																					},
																				},
																			},
																		},
																	},
																},
															},
														},
													},
												},
											},
										},
									},
								},
							},
						},
					},
				},
			},
		},
	}
	for i := 0; i < b.N; i++ {
		if _, err := proto.Marshal(m); err != nil {
			b.Fatal(err)
		}
	}
}

func Benchmark16(b *testing.B) {
	m := &pb.Struct{
		Fields: map[string]*pb.Value{
			"0": &pb.Value{
				Kind: &pb.Value_StructValue{
					&pb.Struct{
						Fields: map[string]*pb.Value{
							"0": &pb.Value{
								Kind: &pb.Value_StructValue{
									&pb.Struct{
										Fields: map[string]*pb.Value{
											"0": &pb.Value{
												Kind: &pb.Value_StructValue{
													&pb.Struct{
														Fields: map[string]*pb.Value{
															"0": &pb.Value{
																Kind: &pb.Value_StructValue{
																	&pb.Struct{
																		Fields: map[string]*pb.Value{
																			"0": &pb.Value{
																				Kind: &pb.Value_StructValue{
																					&pb.Struct{
																						Fields: map[string]*pb.Value{
																							"0": &pb.Value{
																								Kind: &pb.Value_StructValue{
																									&pb.Struct{
																										Fields: map[string]*pb.Value{
																											"0": &pb.Value{
																												Kind: &pb.Value_StructValue{
																													&pb.Struct{

																														Fields: map[string]*pb.Value{
																															"0": &pb.Value{
																																Kind: &pb.Value_StructValue{
																																	&pb.Struct{
																																		Fields: map[string]*pb.Value{
																																			"0": &pb.Value{
																																				Kind: &pb.Value_StructValue{
																																					&pb.Struct{
																																						Fields: map[string]*pb.Value{
																																							"0": &pb.Value{
																																								Kind: &pb.Value_StructValue{
																																									&pb.Struct{
																																										Fields: map[string]*pb.Value{
																																											"0": &pb.Value{
																																												Kind: &pb.Value_StructValue{
																																													&pb.Struct{
																																														Fields: map[string]*pb.Value{
																																															"0": &pb.Value{
																																																Kind: &pb.Value_StructValue{
																																																	&pb.Struct{
																																																		Fields: map[string]*pb.Value{
																																																			"0": &pb.Value{
																																																				Kind: &pb.Value_StructValue{
																																																					&pb.Struct{
																																																						Fields: map[string]*pb.Value{
																																																							"0": &pb.Value{
																																																								Kind: &pb.Value_StructValue{
																																																									&pb.Struct{
																																																										Fields: map[string]*pb.Value{
																																																											"0": &pb.Value{
																																																												Kind: &pb.Value_StructValue{
																																																													&pb.Struct{
																																																														Fields: map[string]*pb.Value{
																																																															"0": &pb.Value{Kind: &pb.Value_NumberValue{1}},
																																																														},
																																																													},
																																																												},
																																																											},
																																																										},
																																																									},
																																																								},
																																																							},
																																																						},
																																																					},
																																																				},
																																																			},
																																																		},
																																																	},
																																																},
																																															},
																																														},
																																													},
																																												},
																																											},
																																										},
																																									},
																																								},
																																							},
																																						},
																																					},
																																				},
																																			},
																																		},
																																	},
																																},
																															},
																														},
																													},
																												},
																											},
																										},
																									},
																								},
																							},
																						},
																					},
																				},
																			},
																		},
																	},
																},
															},
														},
													},
												},
											},
										},
									},
								},
							},
						},
					},
				},
			},
		},
	}
	for i := 0; i < b.N; i++ {
		if _, err := proto.Marshal(m); err != nil {
			b.Fatal(err)
		}
	}
}

cherrymui · 2018-06-20T17:28:15Z

Yeah, map is the rare slow case. It all uses reflection. And it allocates a lot. I just tried to write a specialized version just for string-message maps. There is good speed up. But in practice we would need NxM key-value type combinations. Given that maps are considered rare, I'm not sure we want to go this way.

While writing it, I did find a quadratic behavior in the reflection-based code. We call valSizer during marshal. If the value is a message with deeply nested maps, like this one, we will run sizer at each level. We should use cached size for message value. I have a CL for that. Benchmark results (using Lukasz's benchmark above):

name   old time/op    new time/op    delta
1-12     1.96µs ±19%    1.76µs ±18%     ~     (p=0.075 n=10+10)
2-12     5.10µs ± 6%    3.84µs ± 8%  -24.69%  (p=0.000 n=9+10)
4-12     14.2µs ±12%     8.2µs ±15%  -42.77%  (p=0.000 n=10+10)
8-12     43.0µs ±10%    15.6µs ±11%  -63.81%  (p=0.000 n=9+10)
16-12     138µs ±12%      33µs ±17%  -76.32%  (p=0.000 n=10+10)

name   old alloc/op   new alloc/op   delta
1-12       376B ± 0%      376B ± 0%     ~     (all equal)
2-12       928B ± 0%      752B ± 0%  -18.97%  (p=0.000 n=10+10)
4-12     2.54kB ± 0%    1.49kB ± 0%  -41.51%  (p=0.000 n=10+10)
8-12     7.89kB ± 0%    2.96kB ± 0%  -62.47%  (p=0.000 n=10+10)
16-12    27.0kB ± 0%     5.9kB ± 0%  -78.11%  (p=0.000 n=10+10)

name   old allocs/op  new allocs/op  delta
1-12       12.0 ± 0%      12.0 ± 0%     ~     (all equal)
2-12       28.0 ± 0%      23.0 ± 0%  -17.86%  (p=0.000 n=10+10)
4-12       75.0 ± 0%      45.0 ± 0%  -40.00%  (p=0.000 n=10+10)
8-12        229 ± 0%        89 ± 0%  -61.14%  (p=0.000 n=10+10)
16-12       777 ± 0%       177 ± 0%  -77.22%  (p=0.000 n=10+10)

LMMilewski · 2018-06-20T23:54:28Z

I just tried to write a specialized version just for string-message maps. There is good speed up. But in practice we would need NxM key-value type combinations.

I guess it depends on the speedup you can get but (without pushing for / against the specialization) I want to note that string->message is most likely the only case where "maps are rare" doesn't hold. It's popular because the protobuf spec provides a way to represent arbitrary json as protos.

That leads to arbitrary string->anything maps ("anything" is represented as a message with a single oneof). Apparently it's used even in Google cloud APIs (I don't remember which ones though). That's also what OP's benchmark does.

dsnet · 2018-06-21T00:06:49Z

Here's a crude histogram of map types across a large corpus:

   8412 map<string,MessageOrEnum>   // 35.7%
   4499 map<string,string>          // 54.9%
   1598 map<int32,MessageOrEnum>    // 61.7%
    924 map<string,int64>           // 65.6%
    781 map<int64,MessageOrEnum>    // 68.9%
    749 map<string,int32>           // 72.1%
    603 map<string,double>          // 74.7%
    548 map<int32,int32>            // 77.0%
    508 map<string,float>           // 79.2%
    332 map<int32,string>           // 80.6%
    312 map<uint64,MessageOrEnum>   // 81.9%
    281 map<int64,int64>            // 83.1%
    253 map<string,bool>            // 84.2%
    243 map<string,bytes>           // 85.2%
    186 map<int32,double>           // 86.0%
    174 map<int32,int64>            // 86.8%
    156 map<uint32,MessageOrEnum>   // 87.4%
    154 map<int64,string>           // 88.1%
    141 map<int32,float>            // 88.7%
    118 map<string,uint32>          // 89.2%
    112 map<string,uint64>          // 89.7%
    104 map<uint32,uint32>          // 90.1%
    100 map<uint64,uint64>          // 90.5%
     92 map<int32,bool>             // 90.9%
     90 map<bool,bool>              // 91.3%
     86 map<fixed64,fixed64>        // 91.7%
     84 map<sint64,sint64>          // 92.0%
     84 map<sint32,sint32>          // 92.4%
     84 map<sfixed64,sfixed64>      // 92.7%
     84 map<sfixed32,sfixed32>      // 93.1%
     84 map<fixed32,fixed32>        // 93.5%
     70 map<int64,int32>            // 93.8%
     66 map<fixed64,MessageOrEnum>  // 94.0%
     65 map<int32,bytes>            // 94.3%
     62 map<int64,double>           // 94.6%
     46 map<int32,uint64>           // 94.8%
     44 map<bool,MessageOrEnum>     // 95.0%
     40 map<bool,string>            // 95.1%
     39 map<uint64,string>          // 95.3%
     39 map<uint64,int64>           // 95.5%
     36 map<int64,float>            // 95.6%
     30 map<uint64,int32>           // 95.7%
     30 map<sint64,MessageOrEnum>   // 95.9%
     27 map<uint32,string>          // 96.0%
     27 map<bool,bytes>             // 96.1%
     24 map<bool,int32>             // 96.2%
     23 map<int64,bool>             // 96.3%
     22 map<uint64,double>          // 96.4%
     20 map<uint64,bool>            // 96.5%
     20 map<uint32,uint64>          // 96.6%
     20 map<uint32,int32>           // 96.6%
     18 map<uint32,int64>           // 96.7%
     17 map<uint64,float>           // 96.8%
     17 map<fixed64,int32>          // 96.9%
     16 map<fixed32,MessageOrEnum>  // 96.9%
     15 map<uint32,double>          // 97.0%
     15 map<int32,uint32>           // 97.1%
     13 map<sint32,MessageOrEnum>   // 97.1%
     13 map<int64,uint64>           // 97.2%
     12 map<sfixed64,MessageOrEnum> // 97.2%
     12 map<sfixed32,MessageOrEnum> // 97.3%
     11 map<uint64,bytes>           // 97.3%
     11 map<uint32,bool>            // 97.4%
     11 map<int32,fixed64>          // 97.4%
     ...

Since this was done purely by parsing, I couldn't tell if MessageOrEnum refers to a message type or enum type.

cherrymui · 2018-06-21T01:51:36Z

name   old time/op    new time/op    delta
1-12     1.76µs ±18%    0.68µs ± 7%  -61.54%  (p=0.000 n=10+9)
2-12     3.84µs ± 8%    1.32µs ±13%  -65.54%  (p=0.000 n=10+10)
4-12     8.15µs ±15%    2.67µs ± 9%  -67.24%  (p=0.000 n=10+10)
8-12     15.6µs ±11%     5.4µs ±13%  -65.10%  (p=0.000 n=10+10)
16-12    32.8µs ±17%    10.5µs ± 4%  -67.83%  (p=0.000 n=10+9)

name   old alloc/op   new alloc/op   delta
1-12       376B ± 0%       24B ± 0%  -93.62%  (p=0.000 n=10+10)
2-12       752B ± 0%       48B ± 0%  -93.62%  (p=0.000 n=10+10)
4-12     1.49kB ± 0%    0.08kB ± 0%  -94.62%  (p=0.000 n=10+10)
8-12     2.96kB ± 0%    0.14kB ± 0%  -95.14%  (p=0.000 n=10+10)
16-12    5.92kB ± 0%    0.29kB ± 0%  -95.14%  (p=0.000 n=10+10)

name   old allocs/op  new allocs/op  delta
1-12       12.0 ± 0%       2.0 ± 0%  -83.33%  (p=0.000 n=10+10)
2-12       23.0 ± 0%       3.0 ± 0%  -86.96%  (p=0.000 n=10+10)
4-12       45.0 ± 0%       5.0 ± 0%  -88.89%  (p=0.000 n=10+10)
8-12       89.0 ± 0%       9.0 ± 0%  -89.89%  (p=0.000 n=10+10)
16-12       177 ± 0%        17 ± 0%  -90.40%  (p=0.000 n=10+10)

This is comparing specialized string-message map (new) vs. current implementation with my CL above applied (old). I haven't implemented deterministic marshal. In the deterministic case the speedup would be smaller, as we need to allocate a slice of keys and sort it.

Specialization only works with unsafe-based implementation. We can't specialized arbitrary message type in pure reflection code.

We could specialize only string-message maps, especially if this is documented as "special" in some ways (I just don't want to draw the line too arbitrarily). Or we could specialize all string-T maps (if there is a good reason).

@LMMilewski

If the value of a map is a message with nested maps, calling valSizer in marshal may be quadratic -- it will call size at each level. Fix by using cached size if the value type is message. Benchmark results with @LMMilewski's test case from #624: name old time/op new time/op delta 1-12 1.96µs ±19% 1.76µs ±18% ~ (p=0.075 n=10+10) 2-12 5.10µs ± 6% 3.84µs ± 8% -24.69% (p=0.000 n=9+10) 4-12 14.2µs ±12% 8.2µs ±15% -42.77% (p=0.000 n=10+10) 8-12 43.0µs ±10% 15.6µs ±11% -63.81% (p=0.000 n=9+10) 16-12 138µs ±12% 33µs ±17% -76.32% (p=0.000 n=10+10) name old alloc/op new alloc/op delta 1-12 376B ± 0% 376B ± 0% ~ (all equal) 2-12 928B ± 0% 752B ± 0% -18.97% (p=0.000 n=10+10) 4-12 2.54kB ± 0% 1.49kB ± 0% -41.51% (p=0.000 n=10+10) 8-12 7.89kB ± 0% 2.96kB ± 0% -62.47% (p=0.000 n=10+10) 16-12 27.0kB ± 0% 5.9kB ± 0% -78.11% (p=0.000 n=10+10) name old allocs/op new allocs/op delta 1-12 12.0 ± 0% 12.0 ± 0% ~ (all equal) 2-12 28.0 ± 0% 23.0 ± 0% -17.86% (p=0.000 n=10+10) 4-12 75.0 ± 0% 45.0 ± 0% -40.00% (p=0.000 n=10+10) 8-12 229 ± 0% 89 ± 0% -61.14% (p=0.000 n=10+10) 16-12 777 ± 0% 177 ± 0% -77.22% (p=0.000 n=10+10) Fixes #624.

srenatus · 2018-07-09T08:18:34Z

Thanks for addressing this.

I've tried to re-run my aforementioned benchmark with the latest revision (1325a0), but it doesn't seem to compile anymore:

$ make -f ../Makefile bench
go test -v -count=10 -benchmem -bench .
panic: reflect.Set: value of type map[string]*structpb.Value is not assignable to type map[string]*structpb.Value
goroutine 49 [running]:
reflect.Value.assignTo(0x11e8c60, 0xc420092d50, 0x15, 0x1226774, 0xb, 0x11e8d20, 0x0, 0x100728d, 0x11e8c60, 0xc420092d50)
        /usr/local/Cellar/go/1.10/libexec/src/reflect/value.go:2235 +0x427
reflect.Value.Set(0x11e8d20, 0xc420092cf0, 0x195, 0x11e8c60, 0xc420092d50, 0x15)
        /usr/local/Cellar/go/1.10/libexec/src/reflect/value.go:1373 +0xa4
github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb.(*Unmarshaler).unmarshalValue(0xc4200aa020, 0x12056e0, 0xc420092cf0, 0x199, 0xc4201f0000, 0x63, 0x70, 0xc4201c7740, 0x1119abb, 0xc4200aa8a0)
        /Users/stephan/go/src/github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb/jsonpb.go:822 +0x2461
github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb.(*Unmarshaler).unmarshalValue(0xc4200aa020, 0x120cfc0, 0xc4201ee030, 0x196, 0xc4201f0000, 0x63, 0x70, 0xc4201c7740, 0x11d60f4, 0x38)
        /Users/stephan/go/src/github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb/jsonpb.go:713 +0x2a6
github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb.(*Unmarshaler).unmarshalValue(0xc4200aa020, 0x121b860, 0xc4201ee000, 0x199, 0xc4201e2000, 0x1a7, 0x1c0, 0xc4201c7440, 0x1001d98, 0xc4201de578)
        /Users/stephan/go/src/github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb/jsonpb.go:934 +0x7d3
github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb.(*Unmarshaler).unmarshalValue(0xc4200aa020, 0x12197c0, 0xc4200a6910, 0x196, 0xc4201e2000, 0x1a7, 0x1c0, 0xc4201c7440, 0x0, 0x0)
        /Users/stephan/go/src/github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb/jsonpb.go:713 +0x2a6
github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb.(*Unmarshaler).unmarshalValue(0xc4200aa020, 0x11d03a0, 0xc4200fecd0, 0x197, 0xc4201ba000, 0x15ac, 0x1800, 0xc4201c7440, 0x11e1e09, 0x41)
        /Users/stephan/go/src/github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb/jsonpb.go:994 +0x1d54
github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb.(*Unmarshaler).unmarshalValue(0xc4200aa020, 0x1220660, 0xc4200fec00, 0x199, 0xc42017c000, 0x19565, 0x1a000, 0x0, 0x1215460, 0x1050f01)
        /Users/stephan/go/src/github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb/jsonpb.go:934 +0x7d3
github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb.(*Unmarshaler).UnmarshalNext(0xc4200aa020, 0xc4200b8000, 0x124b880, 0xc4200fec00, 0x19501, 0xc4200aa080)
        /Users/stephan/go/src/github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb/jsonpb.go:663 +0x13d
github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb.(*Unmarshaler).Unmarshal(0xc4200aa020, 0x124ad00, 0xc4200aa080, 0x124b880, 0xc4200fec00, 0x19566, 0x0)
        /Users/stephan/go/src/github.com/srenatus/pb-bench/r-1325a0/vendor/github.com/golang/protobuf/jsonpb/jsonpb.go:674 +0x7c
github.com/srenatus/pb-bench/r-1325a0.getRunMsg(0xc4200fe900, 0x102e8bd)
        /Users/stephan/go/src/github.com/srenatus/pb-bench/r-1325a0/benchmark_test.go:111 +0x1ea
github.com/srenatus/pb-bench/r-1325a0.BenchmarkRunMsgMarshal(0xc4200fe900)
        /Users/stephan/go/src/github.com/srenatus/pb-bench/r-1325a0/benchmark_test.go:25 +0x32
testing.(*B).runN(0xc4200fe900, 0x1)
        /usr/local/Cellar/go/1.10/libexec/src/testing/benchmark.go:141 +0xb2
testing.(*B).run1.func1(0xc4200fe900)
        /usr/local/Cellar/go/1.10/libexec/src/testing/benchmark.go:214 +0x5a
created by testing.(*B).run1
        /usr/local/Cellar/go/1.10/libexec/src/testing/benchmark.go:207 +0x80
exit status 2
FAIL    github.com/srenatus/pb-bench/r-1325a0   0.015s
make: *** [bench] Error 1

This was running on this branch. Am I missing something?

puellanivis · 2018-07-09T08:42:32Z

panic: reflect.Set: value of type map[string]*structpb.Value is not assignable to type map[string]*structpb.Value

Oh, this kind of error. I've run into this sort of thing before. Typically, the problem was that the two types are being imported from different packages, despite appearing to have the same package, they're actually from different packages (often also weird interactions due to vendored code).

srenatus · 2018-07-09T09:04:27Z

@puellanivis Thanks, good point. While this line seems relevant, the qualified import there was merely for the sake of clarity (since the path didn't match the package name in it)... Also, changing that line to remove the "rename", the error persists, it's got to be somewhere else 🤔

puellanivis · 2018-07-09T09:52:06Z

Found the problem, you are still including the api from the r-3a3da3 directory. This is using a different structpb from the r-1325a0 the following command-line fixes the benchmark test.

$ sed -i 's/r-3a3da3/r-1325a0/' r-1325a0/benchmark_test.go

srenatus · 2018-07-09T10:35:35Z

Oh dear -- I'm an idiot. Must have had too little coffee back then. Thank you so much 😄

srenatus · 2018-07-09T10:49:04Z

Thanks again @puellanivis ✅ this has resolved my issue, sorry for the noise.

dsnet changed the title ~~performance degradation with v1.1.0 (and perhaps big messages)~~ proto: performance degradation with v1.1.0 Jun 6, 2018

dsnet added the needs-investigation label Jun 6, 2018

cherrymui mentioned this issue Jun 20, 2018

proto: fix quadratic behavior in nested map marshaling #641

Merged

dsnet closed this as completed in #641 Jul 6, 2018

jmillikin-stripe mentioned this issue Jan 8, 2019

proto: specialize map<string, string> for performance #781

Open

golang locked as resolved and limited conversation to collaborators Jun 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

proto: performance degradation with v1.1.0 #624

proto: performance degradation with v1.1.0 #624

srenatus commented Jun 1, 2018

dsnet commented Jun 1, 2018

srenatus commented Jun 18, 2018

LMMilewski commented Jun 19, 2018

LMMilewski commented Jun 19, 2018

srenatus commented Jun 19, 2018

LMMilewski commented Jun 19, 2018

randall77 commented Jun 19, 2018

randall77 commented Jun 19, 2018

LMMilewski commented Jun 19, 2018

cherrymui commented Jun 20, 2018

LMMilewski commented Jun 20, 2018

dsnet commented Jun 21, 2018

cherrymui commented Jun 21, 2018

srenatus commented Jul 9, 2018

puellanivis commented Jul 9, 2018

srenatus commented Jul 9, 2018

puellanivis commented Jul 9, 2018 •

edited

Loading

srenatus commented Jul 9, 2018

srenatus commented Jul 9, 2018

proto: performance degradation with v1.1.0 #624

proto: performance degradation with v1.1.0 #624

Comments

srenatus commented Jun 1, 2018

dsnet commented Jun 1, 2018

srenatus commented Jun 18, 2018

LMMilewski commented Jun 19, 2018

LMMilewski commented Jun 19, 2018

srenatus commented Jun 19, 2018

LMMilewski commented Jun 19, 2018

randall77 commented Jun 19, 2018

randall77 commented Jun 19, 2018

LMMilewski commented Jun 19, 2018

cherrymui commented Jun 20, 2018

LMMilewski commented Jun 20, 2018

dsnet commented Jun 21, 2018

cherrymui commented Jun 21, 2018

srenatus commented Jul 9, 2018

puellanivis commented Jul 9, 2018

srenatus commented Jul 9, 2018

puellanivis commented Jul 9, 2018 • edited Loading

srenatus commented Jul 9, 2018

srenatus commented Jul 9, 2018

puellanivis commented Jul 9, 2018 •

edited

Loading