Starting with Objects

A general rule of good program design (not always true, but worth considering) is that you should identify the objects in the real world, and make sure that each has a proper representation in the program. In the Goldfish Club (discussed presiously), the real world objects are people, goldfish, the person-goldfish relationship, and the club's membership list.
        People and Goldfish are solid unarguable objects, but some might argue about the relationship between a goldfish and a person. We are not concerned philosophical arguments about what makes an object. For us, an object is anything, concrete or abstract, that has some kind of existence and is of importance to our program. The connection between members of the Goldfish Club and their goldfish certainly qualifies.
        So we have four kinds of real world objects. None of them correspond to software objects in our program. Each person is certainly represented in the arrays, but there is no one thing that can be indentified as the representation of a person. We could say that the 73rd person in the club's records is represented by the collection of ten items <memnum[73], title[73], fname[73], lname[73], straddr[73], city[73], state[73], zip[73], phone[73], birthday[73]>, but that is not a single object, and it isn't even true anyway. If one person owns three goldfish, they will appear in the list three times, so their "representation" is spread over a number of ten-item groups. Imagine having to change a member's name (when he or she gets married or joins a new religion); there is no one place where the change could be recorded, and that is very strong evidence that there is no one object representing the person.
        All the other kinds of objects are in the same position. The information that is required for the object is present, but not in the form of a proper object. The main thing that C++ and other object oriented languages give us, is the ability to define proper objects.

        In C++, we could define new kinds of objects to represent people and goldfish quite easily. This is the syntax:
class Person
{ public:
    char *memnum;
    char *title;
    char *fname;
    char *lname;
    char *straddr;
    char *city;
    char *state;
    char *zip;
    char *phone;
    char *birthday; };
class Goldfish
{ public:
    char *name;
    char *species;
    char *birthday;
    char *birthyear;
    char *favcolour; };




    
The introduction: "class Person" simply says that we are defining a new class. Class is the C++ term for a "kind of object". The class called Person will be the representation for all persons, not just one. After these declarations, Person and Goldfish become new types, just like int and float, and we will be able to define variables that contain Persons just as we can define variables that contain integers: "int x;" and "Person x;" are both valid declarations, and mean more-or-less the same thing.
        The curly brackets are required to surround the parts that make up the class, otherwise they would look like normal declarations of global variables. The word "public" indicates that all of the parts about to be defined are generally accessible. It is possible to protect parts against accidental (or deliberate) changes by saying protected or private instead, but that will come later. The rest of the definition of the class is simply a list of normal variable declarations. They could have any types, and any names. In this example they are all char * because that's how they were in the previous example. It would be much more sensible to make some of them (such as memnum, birthday, and birthyear) into ints instead, and soon we will.
        Those class definitions do not define any variables at all. They do not create variables called title or birthday or any of the others. They simply specify that any time you create a variable of type Person, it will contain within it, sub-variables called title, fname, and so on. This little example shows what that means:
          Person a, b;
          a.title="Mrs";
          a.fname="Semolina";
          a.lname="Pudding";
          a.straddr="221b Baker St";
          a.city="Origami";
          a.state="MT";
          a.zip="83741";
          a.phone="2839981243";
          a.birthday="1225";     // i.e. 25th December
          b=a;
          b.title="Mr";
          b.fname="Poindexter";
          b.birthday="1031";
The only variables created in this example are a and b. Each of them have the nine defined parts that can be accessed individually as variables in their own right, but the object as a whole also has a real existence, as illustrated by the assignment b=a; which copies all of a's parts into b.
        It is absolutely not a problem that both Person and Goldfish have sub-variables called birthday. We are only allowed to access a subvariable in a context that makes the whole object obvious (e.g. in b.birthday="1031"; it is completely obvious that we are talking about b's birthday and nobody else's), so no confusion can ever occur.
        We can also have arrays of objects:
          Person people[100];
          people[1]=a;
          people[2]=b;
An object in an array is no different from an object in a variable of its own. We can access its sub-variables, assign it, or do anything else:
          for (i=1; i<=N; i+=1)
            printf("Person %d is %s %s\n", i, people[i].title, people[i].lname);
We can even have pointers to objects. In fact, objects are nearly always accessed through pointers, for reasons that will become clear later.
          Person *p;
          p=&a;
The normal syntax for accessing a sub-variable through a pointer is quite ugly. First we have to apply * to the pointer to find the object that it refers to, then we have to apply . to the object to reach the correct sub-variable:
          (*p).zip="65432";
so C gives us a combined notation for accessing a sub-variable through a pointer:
          p->zip="65432";
-> is an operator that combines *'s pointer-following and .'s subvariable-accessing into one.

Correct Terminology:
        
An Object is a single self-contained lump of data, whose real existence is totally internal to a program. a, b, and p in the above examples, are not objects. a and b are variables that contain objects (it is a very subtle distinction, but significant nonetheless); p is a variable that points to an object.
        A Class is a data type that represents all possible objects of a given kind. The relationship between Object and Class is the same as the relationship between the number 3 and the type int.
        A Member is what I have been informally calling a sub-variable. It is simply one of the named individual parts that make up an object or a class.


Now that we have good solid software objects to represent at least two of our real world objects, we would be sensible to start designing some functions to make programming with them more convenient. There are certain operations that are definitely going to be performed very frequently. For example, Printing a person's or a goldfish's details is probably going to be a very common task, so we would probably want to write:
          void print(Person p)
          { printf("%s %s %s of %s\n", p.title, p.fname, p.lname, p.city); }

          void print(Goldfish g)
          { printf("%s, a %s born is %s\n", g.name, g.species, g.birthyear); }
And we would instantly see that there is going to be some competition for the most popular function names. They can't both be called print (actually in C++ they can, but that is beside the point). C++ solves this problem by allowing functions to be members of classes, just like variables. A function that is a member of a class is properly called a Method of that class. To illustrate this, we'll redefine the two classes:
class Person
{ public:
    char *memnum;
    char *title;
    char *fname;
    char *lname;
    char *straddr;
    char *city;
    char *state;
    char *zip;
    char *phone;
    char *birthday;
    void print(void); };
class Goldfish
{ public:
    char *name;
    char *species;
    char *birthday;
    char *birthyear;
    char *favcolour;
    void print(void); };





   
void Person::print(void)
{ printf("%s %s %s of %s\n",
       title, fname, lname, city); }
void Goldfish::print(void)
{ printf("%s, a %s born is %s\n",
       name, species, birthyear); }
Whenever you want to put a method in a class, just put a prototype for the function inside the class definition, and a proper complete definition somewhere outside. The prototype "void print(void);" inside Person's definition is simply a promise that Person's own version of print will be defined at some point in the future. When that point in the future is reached, and we are ready to define the function, we have to make it clear which version of print is being defined. That is why the Person:: appeared in front of the name.
        In fact, Person:: isn't in front of the name, it is really part of the name. All members (and methods) of a class can only be accessed in a context that makes it clear exactly which object they are members of. Remember we have to say "a.fname"; just "fname" would not be allowed. Similarly, to print something, we aren't allowed to just say "print()", we must make it clear whose print method is being used: "a.print()". Whenever there is no object handy to make the distinction clear, we can use the member's full name instead; fname is just an abbreviation for the full name Person::fname.
        A method can only be called when it is clear which object's method is meant. That means that a method will only ever run when it is known which object it belongs to. That means that there is no need specify an object again inside a method. It is ok to say fname and lname alone.
        Here's how it works:
          Person her;
          her.title="Miss";
          her.fname="Frogalina";
          her.lname="Sprocket";
          her.city="Ouagadougou";
          her.print();
The statement her.print() means call Person::print, and give it the object her to work on, so inside print, any access to fname is automatically considered to be an access to her.fname, and everything works nicely.
        Methods are always called in connection with a particular object, and they always have direct access to that object's members. That is the real point of a class method.


        Another important concern is that all members of an object should be properly initialised before they are used. Not all members necessarily need initialisation, but some do. Suppose we failed to assign a string to her.city before calling her.print(). The function would have no way of knowing that anything is amiss, and it would attempt to print city as a string. Anything could happen, an uninitialised string is just a random pointer somewhere in memory.
        C++ gives us a way to ensure that all important members are initialised automatically every time a new object is created. This is done by defining a Constructor. A constructor is just a normal method or function with two special attributes. One is that it must have exactly the same name as the class that it belongs to; the other is that it must not have a return type, not even void. To add constructors to our two classes, we would add simple prototypes to the class definitions. Goldfish has this line added to it:
              Goldfish(void);
and Person has this line added to it:
              Person(void);
Then outside the class definitions, we provide the complete definitions for the two constructors. Remember that all we have to do is ensure that all the members are put in safe initial states; we don't need to put any useful information anywhere:
          Goldfish::Goldfish(void)
          { name=""; species="";
            birthday=""; birthyear="";
            favcolour=""; }

          Person::Person(void)
          { memnum="0"; title="Mr"; fname="No"; lname="Name";
            birthday=""; straddr=""; city=""; state=""; zip="";
            phone=""; }
These are very boring functions. The point is that they make programs safe. Every time an object is created, no matter how, its constructor is called. There is nothing you can do to prevent the constructor from being called. In fact, you are not allowed to create objects if their class hasn't got a constructor to call. So if we say:
          Person him;
          him.title="Prof";
          him.fname="Wally";
          him.lname="Drab";
          him.print();
even though we forgot to give a value to city, nothing would go wrong, because the constructor has already given it a default safe value.

        It is a bit wasteful to do all those assignments in the constructor if we haven't forgotten anything, because the very next statements will be assigning new values to those same variables. We could define an alternative constructor that is provided with parameters to tell it useful initial values to give to the members.
        This is how it would look:
          Person::Person(char *mn, char *tt, char *fn, char *ln)
          { memnum=mn; title=tt; fname=fn; lname=ln;
            birthday=""; straddr=""; city=""; state=""; zip="";
            phone=""; }
This constructor has four parameters, all strings. It can only be called if four string parameters are provided. That's no problem. When this sort of constructor is in use, declarations look like this:
          Person a("1231","Ms","Henrietta","Chicken");
          Person b("1232","Mr","Ivan","Itch");
The declarations look like function calls, which is exactly what they are. Variable declarations that look like this work just like any other variable declaration; they just allow parameters to be passed to the constructor.
        Surprisingly, we can have both kinds of constructor, and even more. So long as all the constructors for any class can be distinguished by their parameters (either by having a different number of parameters, or by having the same number but with different types) a class can have as many different constructors as you like,
        For an example, I'll give complete definitions for Person and Goldfish, showing exactly how they would appear in a program. We'll have three constructors for a Person, one to be used when we have no useful information yet, one for when we already know the person's name, and one to be used when we know their address too.
          class Person
          { public:
              char *memnum, *title, *fname, *lname;
              char *straddr, *city, *state, *zip;
              char *phone, *birthday;
              void print(void);
              void print_address_label(void);
              Person(void);
              Person(char *mn, char *tt, char *fn, char *ln);
              Person(char *mn, char *tt, char *fn, char *ln, char *sa,
                     char *ci, char *st, char *zi, char *ph, char *bi); };

          class Goldfish
          { public:
              char *name, *species, *birthday, *birthyear, *favcolour;
              void print(void);
              Goldfish(void);
              Goldfish(char *nm);
              Goldfish(char *nm, char *sp, char *bd, char *by, char *fc); };

          void Person::print(void)
          { printf("%s %s %s of %s\n", title, fname, lname, city); }

          void Person::print_address_label(void)
          { printf("%s %c %s\n", title, fname[0], lname);
            printf("%s\n", straddr);
            printf("%s, %s %s\n", city, state, zip); }

          Person::Person(void)
          { memnum="0"; title=""; fname="No"; lname="Name";
            straddr=""; city=""; state=""; zip="";
            phone=""; birthday=""; }

          Person::Person(char *mn, char *tt, char *fn, char *ln)
          { memnum=mn; title=tt; fname=fn; lname=ln;
            straddr=""; city=""; state=""; zip="";
            phone=""; birthday=""; }

          Person(char *mn, char *tt, char *fn, char *ln, char *sa,
                 char *ci, char *st, char *zi, char *ph, char *bi)
          { memnum=mn; title=tt; fname=fn; lname=ln;
            straddr=sa; city=ci; state=st; zip=zi;
            phone=ph; birthday=bi; }

          void Goldfish::print(void)
          { printf("%s, a %s, born in %s\n", name, species, birthyear); }

          Goldfish::Goldfish(void)
          { name=""; species="";
            birthday=""; birthyear=""; favcolour=""; }

          Goldfish::Goldfish(char *nm)
          { name=nm; species="";
            birthday=""; birthyear=""; favcolour=""; }

          Goldfish(char *nm, char *sp, char *bd, char *by, char *fc)
          { name=nm; species=sp;
            birthday=bd; birthyear=by; favcolour=fc; }
With these definitions in force, a program could correctly use all of the following. In every case, it can tell which constructor to call by the type of the variable being created and the number and types of the parameters provided:
          Person one("8239","Dr","Vera","Crooked");
          Person two("12312","Miss","Jellybean","Jones","11 A Av","Aack","AK","12345","1234567");
          Person three;
          Person four("9239","Uncle","Horace","Horse");
          Goldfish x("Goldie");
          Goldfish y("Yellowie");
          Goldfish z("Fluffy","Himalayan Blue","0231","1865","Purple");
          Person *p;
          p=&two;
One final observation about the last two lines. Constructors are called only when an object is created. The declaration Person *p creates a pointer that can point to an object, but it does not create an object. It is exactly the same as with all other pointer declarations. Declaring a pointer never creates an object, so it never causes a constructor to be called, so you are never allowed to provide parameters. The very last statement makes p point to an object that has already been brough into existence (seven lines before), so of course no constructor is called.

You can see a C++ implementation here.

This is just the beginning. If things don't seem quite right, that may be because you haven't seen everything yet. Don't worry about it. This is quite enough new stuff for one week; we'll get to the rest soon enough.