Covariance and Contravariance in .NET C#

Have hard time understanding it? Let me simplify it for you.

If it is so hard on you to understand what Covariance and Contravariance in .NET C# means, don’t feel ashamed of it, you are not alone.

It happened to me and many other developers. I even know experienced developers who either don’t know about them and are using them but still can’t understand them well enough.

From where I see it, this is happening because every time I come across an article talking about Covariance and Contravariance, I find it focused on some technical terminologies rather than being concerned about the reason why we have them in the first place and what we would have missed if they didn’t exist.

Microsoft’s Definition

If you check Microsoft’s documentation for the Covariance and Contravariance in .NET C#, you would find this definition:

In C#, covariance and contravariance enable implicit reference conversion for array types, delegate types, and generic type arguments. Covariance preserves assignment compatibility and contravariance reverses it.

Do you get it? do you like it?

You can search the internet and you will find tons of resources about this topic. You will come across definitions, history, when introduced, code samples,… and many others and this is not what you would find in this story. I promise you that what you would see here is different….

What are they actually?

Basically, what Microsoft did is that they added a small addition to the way you define your generic template type place holder, the famous .

What you used to do when defining a generic interface is to follow the pattern public interface IMyInterface<T> {…}. After having Covariance and Contravariance introduced, you can now follow the pattern public interface IMyInterface<out T> {…} or public interface IMyInterface<in T> {…}.

Do you recognize the extra out and in?
Have you seen them somewhere else?
May be on the famous .NET public interface IEnumerable<out T>?
or the famous .NET public interface IComparable<in T>?

Microsoft introduced a new concept so that the compiler -at design time- would make sure that the types of objects you use and pass around generic members would not throw runtime exceptions caused by wrong type expectations.

Still not clear, right? Just bear with me... Let’s assume that the compiler doesn’t apply any design time restrictions and see what would happen.

What if the compiler doesn’t apply any design time restrictions?

To be able to work on an appropriate example, let’s define the following:

	public class A
	{
	public void F1(){}
	}

	public class B : A
	{
	public void F2(){}
	}

	public class C : B
	{
	public void F3(){}
	}

	public interface IReaderWriter<TEntity>
	{
	TEntity Read();
	void Write(TEntity entity);
	}

	public class ReaderWriter<TEntity> : IReaderWriter<TEntity> where TEntity : new()
	{
	public TEntity Read()
	{
	return new TEntity();
	}

	public void Write(TEntity entity)
	{
	}
	}

view raw IReaderWriter.cs hosted with ❤ by GitHub

Class A has F1() defined.

Class B has F1() and F2() defined.

Class C has F1(), F2() , and F3() defined.

The interface IReaderWriter has Read() which returns an object of type TEntity and Write(TEntity entity) which expects a parameter of type TEntity.

Then let’s define a TestReadWriter() method as follows:

	public static void TestReaderWriter(IReaderWriter<B> param)
	{
	var b = param.Read();
	b.F1();
	b.F2();

	param.Write(b);
	}

view raw TestReaderWriter.cs hosted with ❤ by GitHub

Calling TestReadWriter() when passing in an instance of IReaderWriter

This should work fine as we are not violating any rules. TestReadWriter() is already expecting a parameter of type IReaderWriter.

Calling TestReadWriter() when passing in an instance of IReaderWriter<A>

Keeping in mind the assumption that the compiler doesn’t apply any design time restrictions, this means that:

param.Read() would return an instance of class A, not B
=> So, the var b would actually be of type A, not B
=> This would lead to the b.F2() line to fail as the var b -which is actually of type A- does not have F2() defined

param.Write() line in the code above would be expecting to receive a parameter of type A, not B
=> So, calling param.Write() while passing in a parameter of type B would both work fine

Therefore, since in the point #1 we are expecting a runtime failure, then we can’t call TestReadWriter() with passing in an instance of IReaderWriter<A>.

Calling TestReadWriter() when passing in an instance of IReaderWriter<C>

Keeping in mind the assumption that the compiler doesn’t apply any design time restrictions, this means that:

param.Read() would return an instance of class C, not B
=> So, the var b would actually be of type C, not B
=> This would lead to the b.F2() line to work fine as the var b would have F2()

param.Write() line in the code above would be expecting to receive a parameter of type C, not B
=> So, calling param.Write() while passing in a parameter of type B would fail because simply you can’t replace C with its parent B

Therefore, since in the point #2 we are expecting a runtime failure, then we can’t call TestReadWriter() with passing in an instance of IReaderWriter<C>.

Now, let’s analyze what we have discovered up to this moment:

Calling TestReadWriter(IReaderWriter param) when passing in an instance of IReaderWriter is always fine.

Calling TestReadWriter(IReaderWriter param) when passing in an instance of IReaderWriter<A> would be fine if we don’t have the param.Read() call.

Calling TestReadWriter(IReaderWriter param) when passing in an instance of IReaderWriter<C> would be fine if we don’t have the param.Write() call.

However, since we always have a mix between param.Read() and param.Write(), we would always have to stick to calling TestReadWriter(IReaderWriter param) with passing in an instance of IReaderWriter, nothing else.

Unless…….

The Alternative

What if we make sure that the IReaderWriter<TEntity> interface defines either TEntity Read() or void Write(TEntity entity), not both of them at the same time.

Therefore, if we drop the TEntity Read(), we would be able to call TestReadWriter(IReaderWriter param) with passing in an instance of IReaderWriter<A> or IReaderWriter.

Similarly, if we drop the void Write(TEntity entity), we would be able to call TestReadWriter(IReaderWriter param) with passing in an instance of IReaderWriter or IReaderWriter<C>.

This would be better for us as it would be less restrictive, right?

Time for some Facts

In the real world, the compiler -in design time- would never allow calling TestReadWriter(IReaderWriter param) with passing in an instance of IReaderWriter<A>. You would get a compilation error.

Also, the compiler -in design time- would not allow calling TestReadWriter(IReaderWriter param) with passing in an instance of IReaderWriter<C>. You would get a compilation error.

From point #1 and #2, this is called Invariance.

Even if you drop the TEntity Read() from the IReaderWriter<TEntity> interface, the compiler -in design time- would not allow you to call TestReadWriter(IReaderWriter param) with passing in an instance of IReaderWriter<A>. You would get a compilation error. This is because the compiler would not -implicitly by itself- look into the members defined into the interface and see if it is going to always work at runtime or not. You will need to do this by yourself through <in TEntity>. This acts as a promise from you to the compiler that all members in the interface would either don’t depend on TEntity or deal with it as an input, not an output. This is called Contravariance.

Similarly, even if you drop the void Write(TEntity entity) from the IReaderWriter<TEntity> interface, the compiler -in design time- would not allow you to call TestReadWriter(IReaderWriter param) with passing in an instance of IReaderWriter<C>. You would get a compilation error. This is because the compiler would not -implicitly by itself- look into the members defined into the interface and see if it is going to always work at runtime or not. You will need to do this by yourself through <out TEntity>. This acts as a promise from you to the compiler that all members in the interface would either don’t depend on TEntity or deal with it as an output, not an input. This is called Covariance.

Therefore, adding <out > or <in > makes the compiler less restrictive to serve our needs, not more restrictive as some developers would think.

Summary

At this point, you should already understand the full story of Invariance, Covariance and Contravariance. However, as a quick recap, you can deal with the following as a cheat sheet:

Mix between input and output generic type => Invariance => the most restrictive => can’t replace with parents or children.

Added <in > => only input => Contravariance => itself or replace with parents.

Added <out > => only output => Covariance => itself or replace with children.

Finally, I will drop here some code for you to check. It would help you practice more.

	using System;
	using System.Collections.Generic;
	using System.Linq;
	using System.Text;
	using System.Threading.Tasks;

	namespace DotNetVariance
	{
	class Program
	{
	static void Main(string[] args)
	{
	IReader<A> readerA = new Reader<A>();
	IReader<B> readerB = new Reader<B>();
	IReader<C> readerC = new Reader<C>();
	IWriter<A> writerA = new Writer<A>();
	IWriter<B> writerB = new Writer<B>();
	IWriter<C> writerC = new Writer<C>();
	IReaderWriter<A> readerWriterA = new ReaderWriter<A>();
	IReaderWriter<B> readerWriterB = new ReaderWriter<B>();
	IReaderWriter<C> readerWriterC = new ReaderWriter<C>();

	#region Covariance
	// IReader<TEntity> is Covariant, this means that:
	// 1. All members either don't deal with TEntity or have it in the return type, not the input parameters
	// 2. In a call, IReader<TEntity> could be replaced by any IReader<TAnotherEntity> given that TAnotherEntity
	// is a child -directly or indirectly- of TEntity

	// TestReader(readerB) is ok because TestReader is already expecting IReader<B>
	TestReader(readerB);

	// TestReader(readerC) is ok because C is a child of B
	TestReader(readerC);

	// TestReader(readerA) is NOT ok because A is a not a child of B
	TestReader(readerA);
	#endregion

	#region Contravariance
	// IWriter<TEntity> is Contravariant, this means that:
	// 1. All members either don't deal with TEntity or have it in the input parameters, not in the return type
	// 2. In a call, IWriter<TEntity> could be replaced by any IWriter<TAnotherEntity> given that TAnotherEntity
	// is a parent -directly or indirectly- of TEntity

	// TestWriter(writerB) is ok because TestWriter is already expecting IWriter<B>
	TestWriter(writerB);

	// TestWriter(writerA) is ok because A is a parent of B
	TestWriter(writerA);

	// TestWriter(writerC) is NOT ok because C is a not a parent of B
	TestWriter(writerC);
	#endregion

	#region Invariance
	// IReaderWriter<TEntity> is Invariant, this means that:
	// 1. Some members have TEntity in the input parameters and others have TEntity in the return type
	// 2. In a call, IReaderWriter<TEntity> could not be replaced by any IReaderWriter<TAnotherEntity>

	// IReaderWriter(readerWriterB) is ok because TestReaderWriter is already expecting IReaderWriter<B>
	TestReaderWriter(readerWriterB);

	// IReaderWriter(readerWriterA) is NOT ok because IReaderWriter<B> can not be replaced by IReaderWriter<A>
	TestReaderWriter(readerWriterA);

	// IReaderWriter(readerWriterC) is NOT ok because IReaderWriter<B> can not be replaced by IReaderWriter<C>
	TestReaderWriter(readerWriterC);
	#endregion
	}

	public static void TestReader(IReader<B> param)
	{
	var b = param.Read();
	b.F1();
	b.F2();

	// What if the compiler allows calling TestReader with a param of type IReader<A>, This means that:
	// param.Read() would return an instance of class A, not B
	// => So, the var b would actually be of type A, not B
	// => This would lead to the b.F2() line in the code above to fail as the var b doesn't have F2()

	// What if the compiler allows calling TestReader with a param of type IReader<C>, This means that:
	// param.Read() would return an instance of class C, not B
	// => So, the var b would actually be of type C, not B
	// => This would lead to the b.F2() line in the code above to work fine as the var b would have F2()
	}

	public static void TestWriter(IWriter<B> param)
	{
	var b = new B();
	param.Write(b);

	// What if the compiler allows calling TestWriter with a param of type IWriter<A>, This means that:
	// param.Write() line in the code above would be expecting to receive a parameter of type A, not B
	// => So, calling param.Write() while passing in a parameter of type A or B would both work

	// What if the compiler allows calling TestWriter with a param of type IWriter<C>, This means that:
	// param.Write() line in the code above would be expecting to receive a parameter of type C, not B
	// => So, calling param.Write() while passing in a parameter of type B would not work
	}

	public static void TestReaderWriter(IReaderWriter<B> param)
	{
	var b = param.Read();
	b.F1();
	b.F2();

	param.Write(b);

	// What if the compiler allows calling TestReaderWriter with a param of type IReaderWriter<A>, This means that:
	// 1. param.Read() would return an instance of class A, not B
	// => So, the var b would actually be of type A, not B
	// => This would lead to the b.F2() line in the code above to fail as the var b doesn't have F2()
	// 2. param.Write() line in the code above would be expecting to receive a parameter of type A, not B
	// => So, calling param.Write() while passing in a parameter of type A or B would both work

	// What if the compiler allows calling TestReaderWriter with a param of type IReaderWriter<C>, This means that:
	// 1. param.Read() would return an instance of class C, not B
	// => So, the var b would actually be of type C, not B
	// => This would lead to the b.F2() line in the code above to work fine as the var b would have F2()
	// 2. param.Write() line in the code above would be expecting to receive a parameter of type C, not B
	// => So, calling param.Write() while passing in a parameter of type B would not work
	}
	}

	#region Hierarchy Classes
	public class A
	{
	public void F1()
	{
	}
	}

	public class B : A
	{
	public void F2()
	{
	}
	}

	public class C : B
	{
	public void F3()
	{
	}
	}
	#endregion

	#region Covariant IReader
	// IReader<TEntity> is Covariant as all members either don't deal with TEntity or have it in the return type
	// not the input parameters
	public interface IReader<out TEntity>
	{
	TEntity Read();
	}

	public class Reader<TEntity> : IReader<TEntity> where TEntity : new()
	{
	public TEntity Read()
	{
	return new TEntity();
	}
	}
	#endregion

	#region Contravariant IWriter
	// IWriter<TEntity> is Contravariant as all members either don't deal with TEntity or have it in the input parameters
	// not the return type
	public interface IWriter<in TEntity>
	{
	void Write(TEntity entity);
	}

	public class Writer<TEntity> : IWriter<TEntity> where TEntity : new()
	{
	public void Write(TEntity entity)
	{
	}
	}
	#endregion

	#region Invariant IReaderWriter
	// IReaderWriter<TEntity> is Invariant as some members have TEntity in the input parameters
	// and others have TEntity in the return type
	public interface IReaderWriter<TEntity>
	{
	TEntity Read();
	void Write(TEntity entity);
	}

	public class ReaderWriter<TEntity> : IReaderWriter<TEntity> where TEntity : new()
	{
	public TEntity Read()
	{
	return new TEntity();
	}

	public void Write(TEntity entity)
	{
	}
	}
	#endregion
	}