How do I avoid code duplication in this example?

Question

I have a simple library (let's call it Library #1) that manipulates some data vector (e.g. a time series). I want to make a new library (Library #2) which has essentially most (but not all) of the same functions, but acts on only a single data point in that vector. I imagine a solution that is a thin wrapper of the existing library one, that minimizes code duplication.

Here is the simple Library #1:

class Foo {
private: 
   std::vector data;

public:
   // Some constructors

   double get_data_at_timepoint(int timepoint) const;

   // Some other methods
}

get_data_at_timepoint just returns the appropriate element of the data vector (assuming it exists). The other class in the library Bar has a container of Foo and manipulates them in some way - in particular, it can do_something, and you can also get a Foo:

class Bar {
private:
    std::vector foos;

public:
    // Some constructors

    Foo get_this_foo(int idx) const;

    void do_something();

    // Some other methods
}

where (important) do_something calls get_data_at_timepoint in some way:

void Bar::do_something() {
   // ...
   double x = foos[some_idx].get_data_at_timepoint(some_timepoint);
   // ...
};

The Library #2 I want to also have is Library #1 at a single point in time. Something like:

class Foo2 {
private:
    double data;

public:
    double get_data() const;

    // All those other methods of Foo
}

class Bar2 {
private:
    std::vector foos;

public:
    Foo2 get_this_foo_2(int idx) const;
    void do_something();
    // All those other methods of Bar
}

where now:

void Bar2::do_something() {
   // ...
   double x = foos[some_idx].get_data();
   // ...
};

Clearly, Foo2 is basically just Foo, but with a single data entry. I could rewrite all of Foo, but then I would have to duplicate all the methods. I want instead to define a thin wrapper of Foo that is of length 1 (a single datapoint).

For Foo2, there are two options: (1) subclass Foo, or (2) have Foo2 be a wrapper around a unique ptr to Foo. I think (2) is better because the user should not have access to e.g. timepoints in base class Foo.

I also want to avoid writing extra code for Bar. The function do_something of course needs to be adapted slightly in Bar2, but overall these two seem so parallel. A lot of the other methods in Bar are also the same.

How do I avoid code duplication for Foo2 and Bar2?

Anonymous1847 · Accepted Answer

Do this:

template 
class BarBase {
private:
    std::vector foos;
protected:
    virtual double get_data_from_foo(unsigned int, void*) = 0;
public:
    void do_something();
    // all other methods that used to be in Bar
};

class Bar : public BarBase {
protected:
    virtual void get_data_from_foo(unsigned int id, void* time_ptr) {
        return foos[id].get_data_at_timepoint(*(timepoint_t*)time_ptr);
    }
};

class Bar2 : public BarBase {
protected:
    virtual void get_data_from_foo(unsigned int id, void* dummy) {
        return foos[id].get_data();
    }
};

You’ll have to call get_data_from_foo() inside BarBase::do_something(). You also have to calculate the time point and pass it to that function, regardless of whether it’s needed.

Alternatively, if you don’t mind the code duplication inside do_something(), remove get_data_from_foo() and add a do_something() member function to each of Bar and Bar2, defining them separately.

How do I avoid code duplication in this example?

Answers (2)

Related Questions