Full Text Searching XMLs (Lucene.NET Version)

There was a question in a spanish DBMS list, because a guy was trying to use the SQL Server Full Text Search for indexing any object in the application. The objects were being serialized as XML objects and indexed using the FTS engine, but it started to became complex when trying to perform the query.

Anyway, this post is not about SQL Server, but Lucene.NET. So, I started playing again with that and I end up with this: Download Source Code. I guess this test describes all the power of the mini-framework built over Lucene:

public void MixObjects()
{
    var obj = new Foo()
        .SetDescription("hello world")
        .AddBar("text containing the word happy")
        .AddBar("some comments the word programming");

    var obj2 = new Bar("some comments");

    Engine.IndexObject(obj, 5);
    Engine.IndexObject(obj2, 5);

    Engine.DumpIndexInfo();

    Assert.Equal(1, Engine.LookForAll()
                        .Where(@"Foo\Bars\Bar\Comments", "happy")
                        .Search().ResultCount);

    Assert.Equal(1, Engine.LookForAll()
                        .Where(@"Bar\Comments", "comments")
                        .Search().ResultCount);

    Assert.Equal(0, Engine.LookForAll()
                        .Where(@"Bar\Comments", "unexistent text")
                        .Search().ResultCount);

    Assert.Equal(1, Engine.LookFor<Foo>()
                        .FreeWhere("comments")
                        .Search().ResultCount);

    Assert.Equal(1, Engine.LookFor<Bar>()
                        .FreeWhere("comments")
                        .Search().ResultCount);

    Assert.Equal(2, Engine.LookForAll()
                        .FreeWhere("comments")
                        .Search().ResultCount);
}

this.Blog

Search This Blog

Full Text Searching XMLs (Lucene.NET Version)

Labels

Comments

Post a Comment

Popular posts from this blog

Making Celery 4 work in Django 1.10 and Elastic Beanstalk

Stripping HTML from text in SQL Server–Version 3

Deploying PIL/Pillow to AWS Lambda (Serverless)