Factories, handles, and handle wrappers [C++]

Prev: [ANN] CLI 1.1.0 released
Next: ISO C++ forbids initialization in array new

From: Michael Mol on 17 Dec 2009 15:03

{ Note: It looks like some part of the discussion had been cut off from
clc++m and now is being brought back here. -mod }

On Dec 17, 2:53 pm, Maxim Yegorushkin <maxim.yegorush...(a)gmail.com>
wrote:
> On 16/12/09 23:30, Michael Mol wrote:
>
>
>
> > On Dec 16, 4:50 pm, Maxim Yegorushkin<maxim.yegorush...(a)gmail.com>
> > wrote:
> >> On 16/12/09 13:50, Michael Mol wrote:
>
> >> In other words, you've got:
>
> >> 1) A factory that creates objects.
> >> 2) Those objects implement an interface, which is currently belongs to
> >> factory class.
> >> 3) You'd also like for the factory to check whether the object reference
> >> is valid.
>
> > The code exists, and it's been in the field for over a year. What I'd
> > *like* is an abstracted pass-through to work with existing code.
>
> >> You can refactor this to simply things.
>
> >> 1) Extract object interface from the factory.
>
> >> struct SomeObject {
> >> // former SomeFactory::OperateOnObject
> >> virtual ERRCODE Operate(int someArgument) = 0;
> >> virtual ~SomeObject() = 0;
> >> };
>
> >> Using such object now does not require a factory object, i.e. you can
> >> call Operare() directly on the object.
>
> > Which is the type of apparent behavior I'd like. However, the factory
> > object can't be removed entirely, as it also manages the relationship
> > of object instances to each other and the system resources they happen
> > to extract. (A bit of internal behavior I hoped wasn't necessary to
> > describe in order to ask about intuitive of dereferencing syntax.)
>
> >> 2) Make factory return smart-pointers to SomeObject. The objects it
> >> creates implement SomeObject interface.
>
> >> typedef boost::shared_ptr<SomeObject> SomeObjectPtr;
>
> >> class SomeFactory {
> >> public:
> >> SomeObjectPtr createSomeObject();
> >> ...
> >> };
>
> >> Now the factory function returns a smart-pointer. This smart-pointer
> >> takes care of destroying the object when it is no longer used. No manual
> >> object destruction required.
>
> > The object represents an abstraction of a system resource, and there
> > are a lot of operating factors that weigh in on when it's appropriate
> > to free that system resource. It normally doesn't happen during the
> > weeks-long run of an application, though it does on occasion.
>
> >> 3) Using a smart-pointer makes the possibility of using an already
> >> destroyed object highly unlikely. Checking whether the object reference
> >> is valid may be not necessary any more.
>
> >> New usage:
>
> >> SomeFactory factory;
>
> >> // later in some function or scope
> >> {
> >> SomeObjectPtr object = factory.createSomeObject();
> >> object->Operate(123);
> >> } // now object goes out of scope and gets destroyed automatically
>
> >> This looks to be simpler and more intuitive, isn't it?
>
> > Because of the pervasive and high-traffic use of the subsystem
> > OBJHANDLE is part of, and because of the nature of the application,
> > "highly unlikely" is inevitability, and minimizing risk while meeting
> > the client's feature desires close to their scheduling desires is the
> > order of the day. Given the choice between such a large-scale
> > refactoring and dealing with a new formulation of the code or staying
> > with a tedious syntax, I'd stay with the tedious syntax.
>
> > I understand what you're saying, and for new code design, that would
> > be fine. However, the current code exists, has been fielded for over a
> > year, is a core and frequently-trafficked component where that traffic
> > is fairly sensitive--and it's stable.
>
> > That's why my original fielded question laid out two options; Either
> > can be implemented with few risks to stability, and I was looking for
> > a discussion regarding of wrapping a handle such that it may be
> > operated with a more terse syntax.
>
> I see now.
>
> One possible improvement is to wrap the object handle and a reference to
> a factory in a class, so that its instances can be used as standalone
> objects in the new code that can take advantage of the scoped resource
> management. This can well be the interface shown above.
>
> This way you could have two interfaces to the component -- the existing
> interface and the new one. This allows to gradually evolve to using the
> new interface without breaking the existing code.
>
> The key goal here is to take advantage of RAII / scoped resource
> management. Otherwise you may well be better off using plain C.

If taking advantage of RAII would be beneficial in this case, I'd
definitely look at that. As it is, these particular resources are
usually allocated on application initialization, freed and reallocated
during a runtime system-wide reconfiguration (if such an event
occurs), and are normally release on application shutdown. The
individual instances are continually used during the application's run
(with the exception of during that reconfiguration event).

Even without the runtime configuration, scoped awareness for the
object that OBJHANDLE provides access to isn't beneficial, because the
scope in question would be main()...which is already hidden due to
other frameworks in play.

--
[ See http://www.gotw.ca/resources/clcm.htm for info about ]
[ comp.lang.c++.moderated. First time posters: Do this! ]

From: Michael Mol on 18 Dec 2009 07:31

On Dec 17, 7:52 pm, James Kanze <james.ka...(a)gmail.com> wrote:
> On Dec 16, 1:50 pm, Michael Mol <mike...(a)gmail.com> wrote:
>
> > Let's say you have a factory which returns handles to objects that it
> > allocates. You pass these handles to the factory whenever you want to
> > use the object or destroy it. The factory's "OperateOnObject" class
> > verifies that the handle is within its object collection, and then
> > marshals the OperateOnObject call to the class's internal
> > OperateOnObject method. All external operations on the object
> > instance are required to pass through the Factory in order to
> > guarantee that the object instance is still valid at the time of call.
> > This leads to code roughly like this. (Not using exceptions here, but
> > rather return error codes.)
>
> This is overly complicated; most importantly, it renders client code
> unnecessarily complicated. The simple solution for this is:
>
> 1. Install the Boehm garbage collector and use it for these objects.
>
> 2. Define a flag in each object, which is set in the constructor, and
> reset in the destructor.
>
> 3. Check this flag each time you use the object.

While I'll admit that the current code has flaws in this area, how
does setting a class member "AmIValid" flag guard against
dereferencing an uninitizialized or wild pointer? In either case, I
have an instant crash. (In most such circumstances, the current code
triggers a hard breakpoint in debug, and a null-op return in release.
The only remaining circumstance hasn't been seen, but is presumably
possible.)

More fundamentally, how does it guard against a stale pointer that
winds up pointing to memory overwritten due to some other heap
allocation? Any non-zero value in that flag would indicate a valid
instance, as far as the code that peeks at that flag is concerned.

(If some of the context is missing, I'm sory; Some of the relevant
replies in comp.lang.c++ don't have comp.lang.c++.moderated in the To
or Followup-To fields.)

--
[ See http://www.gotw.ca/resources/clcm.htm for info about ]
[ comp.lang.c++.moderated. First time posters: Do this! ]

From: James Kanze on 18 Dec 2009 21:45

On Dec 19, 12:31 am, Michael Mol <mike...(a)gmail.com> wrote:
> On Dec 17, 7:52 pm, James Kanze <james.ka...(a)gmail.com> wrote:
> > On Dec 16, 1:50 pm, Michael Mol <mike...(a)gmail.com> wrote:

> > > Let's say you have a factory which returns handles to
> > > objects that it allocates. You pass these handles to the
> > > factory whenever you want to use the object or destroy it.
> > > The factory's "OperateOnObject" class verifies that the
> > > handle is within its object collection, and then marshals
> > > the OperateOnObject call to the class's internal
> > > OperateOnObject method. All external operations on the
> > > object instance are required to pass through the Factory
> > > in order to guarantee that the object instance is still
> > > valid at the time of call. This leads to code roughly
> > > like this. (Not using exceptions here, but rather return
> > > error codes.)

> > This is overly complicated; most importantly, it renders
> > client code unnecessarily complicated. The simple solution
> > for this is:

> > 1. Install the Boehm garbage collector and use it for these objects.

> > 2. Define a flag in each object, which is set in the constructor, and
> > reset in the destructor.

> > 3. Check this flag each time you use the object.

> While I'll admit that the current code has flaws in this area,
> how does setting a class member "AmIValid" flag guard against
> dereferencing an uninitizialized or wild pointer? In either
> case, I have an instant crash. (In most such circumstances,
> the current code triggers a hard breakpoint in debug, and a
> null-op return in release. The only remaining circumstance
> hasn't been seen, but is presumably possible.)

It can't do much with regards to a wild pointer (initialized to
some random value). On the other hand, in the above scenario,
the destructor of an object will reset the flag, and garbage
collection will ensure that the underlying memory is not reused.
If all of the member functions check the flag, and if there are
no public data members, all use of a destructed object can be
detected.

> More fundamentally, how does it guard against a stale pointer
> that winds up pointing to memory overwritten due to some other
> heap allocation?

That's why you need garbage collection, point 1 above. After
installing the Boehm collector, you provide a global operator
new function which calls gc_malloc, and a global operator delete
function which does nothing. As long as there is a pointer to
the memory (and sometimes longer), it cannot be reused.

--
James Kanze

--
[ See http://www.gotw.ca/resources/clcm.htm for info about ]
[ comp.lang.c++.moderated. First time posters: Do this! ]

From: Maxim Yegorushkin on 18 Dec 2009 21:43

On 18/12/09 05:47, Michael Mol wrote:
> On Dec 17, 1:19 pm, James Kanze<james.ka...(a)gmail.com> wrote:
>> On Dec 16, 9:50 pm, Maxim Yegorushkin<maxim.yegorush...(a)gmail.com>
>> wrote:
>>> On 16/12/09 13:50, Michael Mol wrote:
> [snip]
>
>>
>> Except that boost::smart_ptr won't necessarily work here---it will
>> render destruction non-deterministic, and will cause objects to "leak"
>> as soon as there are any cycles.
>>
>> Of course, his solution won't work either, since without garbage
>> collection, there's absolutely no way to ensure that the invalid
>> pointer
>> remains invalid.
>
> While the identifier space is currently sparse enough that that hasn't
> been detected as a problem (normally, fewer than twenty allocations
> occur within the lifetime of the application), that's an interesting
> point. I originally sought to deal with it in the beginning by using
> sequential integer values, but settled back on the raw pointer value
> to deal with resulting code complexity.

That sequential counter would need to be embedded both in the object and
in the handle. Comparing these two integers detects whether the object
being referred to is still the same object. This method, however, does
not protect from accessing an object which exists no more.

> I suppose UUIDs would work,
> but would be overkill. A 64-bit sequential integer could work as well;
> The run time of the application, and the frequency of allocations,
> would mean there would never be an overflow.

To reduce the possibility of overflow that counter only needs to be
incremented on an allocation following a deallocation, since several
allocations in a row with no intervening deallocation always return
unique pointers.

--
Max

[ See http://www.gotw.ca/resources/clcm.htm for info about ]
[ comp.lang.c++.moderated. First time posters: Do this! ]

From: Michael Mol on 19 Dec 2009 00:12

On Dec 19, 9:43 am, Maxim Yegorushkin <maxim.yegorush...(a)gmail.com>
wrote:
> On 18/12/09 05:47, Michael Mol wrote:
>
>
>
> > On Dec 17, 1:19 pm, James Kanze<james.ka...(a)gmail.com> wrote:
> >> On Dec 16, 9:50 pm, Maxim Yegorushkin<maxim.yegorush...(a)gmail.com>
> >> wrote:
> >>> On 16/12/09 13:50, Michael Mol wrote:
> > [snip]
>
> >> Except that boost::smart_ptr won't necessarily work here---it will
> >> render destruction non-deterministic, and will cause objects to "leak"
> >> as soon as there are any cycles.
>
> >> Of course, his solution won't work either, since without garbage
> >> collection, there's absolutely no way to ensure that the invalid
> >> pointer
> >> remains invalid.
>
> > While the identifier space is currently sparse enough that that hasn't
> > been detected as a problem (normally, fewer than twenty allocations
> > occur within the lifetime of the application), that's an interesting
> > point. I originally sought to deal with it in the beginning by using
> > sequential integer values, but settled back on the raw pointer value
> > to deal with resulting code complexity.
>
> That sequential counter would need to be embedded both in the object and
> in the handle. Comparing these two integers detects whether the object
> being referred to is still the same object. This method, however, does
> not protect from accessing an object which exists no more.

Having a simple map in the Factory solves the handle->pointer
conversion. (That's actually how I wrote the first draft.) There's
overhead finding the element in the map, but there's overhead in
finding the handle in the existing OBJCOLLECTION.

>
> > I suppose UUIDs would work,
> > but would be overkill. A 64-bit sequential integer could work as well;
> > The run time of the application, and the frequency of allocations,
> > would mean there would never be an overflow.
>
> To reduce the possibility of overflow that counter only needs to be
> incremented on an allocation following a deallocation, since several
> allocations in a row with no intervening deallocation always return
> unique pointers.

So what you're suggesting is creating a UID that's essentially pointer
+incrementor. An interesting idea. In this particular scenario, I
could get away with a 16-bit (or even an 8-bit) incrementor.

--
[ See http://www.gotw.ca/resources/clcm.htm for info about ]
[ comp.lang.c++.moderated. First time posters: Do this! ]

First | Prev | Next | Last
Pages: 1 2 3 4
Prev: [ANN] CLI 1.1.0 released
Next: ISO C++ forbids initialization in array new