I have Table1 with the following relationships (they are not enforced they only create the relationship for the navigation properties)
Table1 (*)->(1) Table2 Tabl
How to force Entity Framework to do Inner Joins if you have a table structure such that:
When you want to look up Students who have passed particular Tests, you would logically do something like:
var studentsWhoPassed = context.Set<StudentEntity>()
.Where(x => x.Something)
.Include(x => x.Schedules.Select(y => y.Classes.Select(z => z.Tests)))
.Etc().Etc()
The point being that you start with a StudentEntity and put in some conditions based on the joins down the chain. But because a Student to Schedule is optional, E.F. generates LEFT OUTER Joins.
Instead, you should start lower down the chain and build up. For example:
var studentsWhoPassed = context.Set<ClassEntity>()
.Where(class => class.Tests.Any(test => test.Status == Status.Passed)
&& class.Schedule.Student.Something == studentSomething)
.Include(class => class.Schedule.Student)
It is weird to start with a Class when you are trying to query for Students with Test Criteria. But it actually makes the LINQ simpler.
Because of the Student does not have to have a Schedule, but... a Class has to have Test(s), and a Class must have a ScheduleID, and Schedules must have a StudentID, you get Inner Joins all the way around.
Granted, this school example is abstract, but the idea holds true to other examples I have worked with the same types of relationships.
in EF when doing IQueryable.Include()
if none of the navigation properties are based on an enforced relationship then EF will use the first table. It expects that at least one of the relationships is enforced in the schema and that one should be coded with the IQueryable.Include()
first, then add the other tables with Include()
EF seems to use INNER JOIN
for including a required and LEFT OUTER JOIN
for including an optional navigation property. Example:
public class Order
{
public int Id { get; set; }
public string Details { get; set; }
public Customer Customer { get; set; }
}
public class Customer
{
public int Id { get; set; }
public string Name { get; set; }
}
If I define Customer
as a required property on Order
...
public class MyContext : DbContext
{
public DbSet<Order> Orders { get; set; }
protected override void OnModelCreating(DbModelBuilder modelBuilder)
{
modelBuilder.Entity<Order>()
.HasRequired(o => o.Customer)
.WithMany();
}
}
...and issue this query...
using (var ctx = new MyContext())
{
var result = ctx.Orders
.Include(o => o.Customer)
.Where(o => o.Details == "Peanuts")
.FirstOrDefault();
}
...I get this SQL:
SELECT TOP (1)
[Extent1].[Id] AS [Id],
[Extent1].[Details] AS [Details],
[Extent2].[Id] AS [Id1],
[Extent2].[Name] AS [Name]
FROM [dbo].[Orders] AS [Extent1]
INNER JOIN [dbo].[Customers] AS [Extent2]
ON [Extent1].[Customer_Id] = [Extent2].[Id]
WHERE N'Peanuts' = [Extent1].[Details]
If I change in the model configuration .HasRequired(o => o.Customer)
to...
.HasOptional(o => o.Customer)
... I get exactly the same query except that INNER JOIN [dbo].[Customers] AS [Extent2]
is replaced by:
LEFT OUTER JOIN [dbo].[Customers] AS [Extent2]
From model viewpoint it makes sense because you are saying that there can never be an Order
without a Customer
if you define the relationship as required. If you circumvent this requirement by removing the enforcement in the database and if you actually have then orders without a customer you violate your own model definition.
Only solution is likely to make the relationship optional if you have that situation. I don't think it is possible to control the SQL that is created when you use Include
.