The articles following the current one will introduce more kinds of Go types. To easily and deeply understand those articles, it is best to read the following content in the current article firstly before reading those articles.
Go can be viewed as a C-family language, which can be confirmed from the two previous articles pointers in Go and structs in Go. Struct types and pointer types in Go and C are much alike.
On the other hand, Go can be also viewed as a C language framework. This is mainly reflected from the fact that Go supports several kinds of types whose value memory structures are not totally transparent, whereas the main characteristic of C types is the memory structures of C values are transparent. Each C value in memory occupies one continuous memory segment. However, values of some kinds of Go types may often consist of more than one continuous memory segments.
Each continuous memory segment hosts a value, more speaking precisely, a value part. The following picture depicts a multi-part value. Its direct part is referencing its indirect underlying part.
The following two paragraphs describe two categories of Go types.
The kinds of types in the first category are all C alike. These types include boolean types, numeric types, struct types, pointer types, array types and unsafe pointer types. Array types will be explained in the next article and unsafe pointer types will be explained in the article type-unsafe pointers later. Each C alike value only consists of one direct transparent value part.
Other kinds of types belong to the second category. Values of types in the category may contain underlying parts, and their direct parts are not transparent. Specifically speaking, these kinds of types includeThe kinds of types in the second category bring much convenience to Go programming by encapsulating many implementation details. Different Go compilers may adopt different internal implementations for these types, but the external behaviors of values of these types must satisfy the requirements specified in Go specification.
These types will be explained detailedly in the following articles.
Note,The types in the second category are not very fundamental types for a language, we can implement them from scratch by using the types from the first category. However, by encapsulating some common or unique functionalities and supporting these types as the first-class citizens in Go, the experiences of Go programming become enjoyable and productive.
On the other hand, these encapsulations adopted in implementing the types in the second category hide many internal definitions of these types. This prevents Go programmers from viewing the whole pictures of these types, and sometimes makes some obstacles to understanding Go better.
To help gophers better understand the types in the second category and their values, the following content of this article will introduce the internal structure definitions of these kinds of types. The detailed implementations of these types will not be explained here. The explanations in this article are based on, but not exactly the same as, the implementations used by the standard Go compiler.
Before showing the internal structure definitions of the kinds of types in the second category, let's clarify more on pointers and references.
We have learned Go pointers in the article before the last.
The pointer types in that article are type-safe pointer types.
In fact, Go also supports type-unsafe pointer types.
The unsafe.Pointer
type provided in the unsafe
standard package is like void*
in C language.
Types whose underlying types are unsafe.Pointer
are called unsafe pointer types in Go 101.
In most other articles in Go 101, if not specially specified, when a pointer type is mentioned, it means a type-safe pointer type. However, in the following parts of the current article, when a pointer type is mentioned, it may be either a type-safe pointer type or a type-unsafe pointer type.
A pointer value stores a memory address of another value, unless the pointer value is a nil pointer. We can say the pointer value references the other value, or the other value is referenced by the pointer value. Values can also be referenced indirectly.a
has a pointer field b
which references a value c
, then we can say the struct
value a
also references value c
.
x
references (either directly or indirectly)
a value y
, and the value y
references
(either directly or indirectly) a value z
,
then we can also say the value x
(indirectly)
references value z
.
Below, we call a struct type with fields of pointer types as a pointer wrapper type, and call a type whose values may contains (either directly or indirectly) pointers a pointer holder type. Pointer types and pointer wrapper types are all pointer holder types. Array types with pointer holder element types are also pointer holder types. (Array types will be explained in the next article.)
The possible internal definitions of the types in the second category are shown below.
// map types
type _map *hashtableImpl
// channel types
type _channel *channelImpl
// function types
type _function *functionImpl
So, internally, types of the three kinds are just pointer types. In other words, the direct parts of values of these types are pointers internally. For each non-zero value of these types, its direct part (a pointer) references its indirect underlying implementation part.
BTW, the standard Go compiler uses hashtables to implement maps.
type _slice struct {
// referencing underlying elements
elements unsafe.Pointer
// number of elements and capacity
len, cap int
}
So, internally, slice types are pointer wrapper struct types.
Each non-zero slice value has an indirect underlying part
which stores the element values of the slice value.
The elements
field of the direct part references
the indirect underlying part of the slice value.
type _string struct {
elements *byte // referencing underlying bytes
len int // number of bytes
}
So string types are also pointer wrapper struct types internally.
Each string value has an indirect underlying part storing
the bytes of the string value, the indirect part is referenced
by the elements
field of that string value.
type _interface struct {
dynamicType *_type // the dynamic type
dynamicValue unsafe.Pointer // the dynamic value
}
Internally, interface types are also pointer wrapper struct types.
The internal definition of an interface type has two pointer fields.
Each non-zero interface value has two indirect underlying parts
which store the dynamic type and dynamic value of that interface value.
The two indirect parts are referenced by the dynamicType
and dynamicValue
fields of that interface value.
type _interface struct {
dynamicTypeInfo *struct {
dynamicType *_type // the dynamic type
methods []*_function // method table
}
dynamicValue unsafe.Pointer // the dynamic value
}
The methods
field of the dynamicTypeInfo
field
of an interface value stores the implemented methods of the dynamic type
of the interface value for the (interface) type of the interface value.
Now we have learned that the internal definitions of the types in the second category are either pointer types or struct types. Certainly, Go compilers will never view values of the types in the second category as pointers or structs in user programs. These definitions are just used internally in implementing Go runtimes.
In Go, each value assignment (including parameter passing, etc) is a shallow value copy if the involved destination and source values have the same type (if their types are different, we can think that the source value will be implicitly converted to the destination type before doing that assignment). In other words, only the direct part of the source value is copied to the destination value in an value assignment. If the source value has underlying value part(s), then the direct parts of the destination and source values will reference the same underlying value part(s), in other words, the destination and source values will share the same underlying value part(s).
In fact, the above descriptions are not 100% correct in theory, for strings and interfaces. The official Go FAQ says the underlying dynamic value part of an interface value should be copied as well when the interface value is copied. However, as the dynamic value of an interface value is read only, the standard Go compiler/runtime doesn't copy the underlying dynamic value parts in copying interface values. This can be viewed as a compiler optimization. The same situation is for string values and the same optimization (made by the standard Go compiler/runtime) is made for copying string values. So, for the standard Go compiler/runtime, the descriptions in the last section are 100% correct, for values of any type.
Since an indirect underlying part may not belong to any value exclusively,
it doesn't contribute to the size returned by the unsafe.Sizeof
function.
I don't mean the reference type or reference value terminologies are totally useless for Go, I just think they are not very essential, and they bring many confusions in using Go. If we do need these terminologies, I prefer to define them as pointer holders. And, my personal opinion is it is best to limit the reference word to only representing relations between values by using it as a verb or a noun, and never use it as an adjective. This will avoid many confusions in leaning, teaching and using Go.