Elasticsearch multi-field and conditional sorting: Painless script implementation guide-javaTutorial-php.cn

Table of Contents

scene description

Solution: Painless script sorting

1. Index mapping and data preparation

2. Implement script sorting

3. Interpretation of results

Things to note and best practices

Summarize

Home

Java

javaTutorial

Elasticsearch multi-field and conditional sorting: Painless script implementation guide

Mary-Kate Olsen

Jan 01, 2026 am 09:33 AM

Elasticsearch multi-field and conditional sorting: Painless script implementation guide

This article deeply explores the implementation method of multi-field sorting based on complex business logic in Elasticsearch. Through the Painless script, we show how to flexibly adjust the sort order of the `createdAt` field according to the presence or absence of the `tags` field in the document, that is, those with tags are sorted in ascending order of `createdAt`, and those without tags are sorted in descending order of `createdAt`, thereby meeting advanced customized sorting requirements.

In Elasticsearch, conventional field sorting is usually to directly arrange one or more fields in ascending order (asc) or descending order (desc). However, when business requirements involve more complex conditional logic, such as determining the sorting direction of a field based on the presence or absence of another field, the standard sorting mechanism cannot directly satisfy it. At this time, Elasticsearch's script-based sorting function is particularly powerful and flexible.

scene description

Suppose we have the following document structure, including the createdAt date field and tags keyword array field:

 doc1:
{
    "createdAt": "2022-11-25T09:45:00.000Z",
    "tags": [
      "Response Needed"
    ]
}
doc2:
{
    "createdAt": "2022-11-24T09:45:00.000Z",
    "tags": [
      "Customer care","Response Needed"
    ]
}
doc3:
{
    "createdAt": "2022-11-24T09:45:00.000Z",
    "tags": [

    ]
}

Our sorting requirements are:

First, sort according to the presence or absence of the tags field: documents with tags are ranked first, documents without tags are ranked last.
Secondly, for documents with tags, sort them in ascending order of the createdAt field.
Finally, for documents without tags, sort them in descending order of the createdAt field.

Solution: Painless script sorting

In order to implement the above complex conditional sorting, we will use Painless script to dynamically calculate the sorting value during the sorting stage.

1. Index mapping and data preparation

First, we need to create an index and define the field mapping, especially createdAt as date type and tags as keyword type so that the script can access and process it correctly.

 PUT idx_conditional_sort
{
  "mappings": {
    "properties": {
      "createdAt": {
        "type": "date"
      },
      "tags": {
        "type": "keyword"
      }
    }
  }
}

Next, we insert some sample documents for testing:

 POST idx_conditional_sort/_doc
{
    "createdAt": "2022-11-25T09:45:00.000Z",
    "tags": [
      "Response Needed"
    ]
}

POST idx_conditional_sort/_doc
{
    "createdAt": "2022-11-24T09:45:00.000Z",
    "tags": [
      "Response 02"
    ]
}

POST idx_conditional_sort/_doc
{
    "createdAt": "2022-11-24T09:45:00.000Z",
    "tags": [
      "Customer care","Response Needed"
    ]
}

POST idx_conditional_sort/_doc
{
    "createdAt": "2022-11-26T09:45:00.000Z",
    "tags": []
}

POST idx_conditional_sort/_doc
{
    "createdAt": "2022-11-23T09:45:00.000Z",
    "tags": []
}

2. Implement script sorting

We will use two scripts to sort elements in an array to implement multi-stage and conditional sorting.

Sorting logic decomposition:

First-level sorting (script 1): used to distinguish documents with tags and documents without tags.
- If the tags array is not empty, the script returns a higher value (eg 1).
- If the tags array is empty, the script returns a lower value (eg 0).
- Sort the results of this script in descending order (desc), ensuring that documents with tags take precedence.
Second-level sorting (script 2): used to conditionally sort createdAt based on tags status.
- If the tags array is not empty: directly return the millisecond timestamp of createdAt, so that the ascending order of createdAt will be achieved when the overall ascending order is arranged.
- If the tags array is empty: return the negative value of createdAt millisecond timestamp, so that when the overall ascending order is arranged, the descending order of createdAt will be achieved (because the larger the negative value, the earlier the actual time).

Complete query statement:

 GET idx_conditional_sort/_search
{
  "sort": [
    {
      "_script": {
        "type": "number",
        "script": {
          "lang": "painless",
          "source": """
            // Script 1: Group according to whether tags exist // Return 1 for tags, return 0 for no tags return doc['tags.keyword'].size() &gt; 0 ? 1 : 0;
          """
        },
        "order": "desc" // Sort in descending order, ensuring that tagged documents are at the front}
    },
    {
      "_script": {
        "type": "number",
        "script": {
          "lang": "painless",
          "source": """
            // Script 2: Conditionally sort createdAt based on tags status long createdAtMillis = doc['createdAt'].value.toInstant().toEpochMilli();
            if (doc['tags.keyword'].size() &gt; 0) {
              // If there are tags, sort them in ascending order by createdAt and return the timestamp directly return createdAtMillis;
            } else {
              // If there is no label, sort by createdAt in descending order and return the negative value of the timestamp // In this way, when sorting in overall ascending order, the greater the negative value (the smaller the absolute value), the later the actual time is, achieving the descending order effect return -createdAtMillis;
            }
          """
        },
        "order": "asc" // Arrange in ascending order, and use the internal logic of the script to implement conditional sorting}
    }
  ]
}

3. Interpretation of results

After executing the above query, Elasticsearch will return the following results (partially):

 {
  "took": ...,
  "timed_out": false,
  "_shards": ...,
  "hits": {
    "total": { "value": 5, "relation": "eq" },
    "max_score": null,
    "hits": [
      {
        "_index": "idx_conditional_sort",
        "_id": "t42r-oQBEoAIompjS0Xh",
        "_score": null,
        "_source": {
          "createdAt": "2022-11-24T09:45:00.000Z",
          "tags": [ "Response 02" ]
        },
        "sort": [ 1, 1669283100000 ] // Tag exists, createdAt ascending order},
      {
        "_index": "idx_conditional_sort",
        "_id": "uI2r-oQBEoAIompjS0Xj",
        "_score": null,
        "_source": {
          "createdAt": "2022-11-24T09:45:00.000Z",
          "tags": [ "Customer care", "Response Needed" ]
        },
        "sort": [ 1, 1669283100000 ] // Tag exists, createdAt ascending order},
      {
        "_index": "idx_conditional_sort",
        "_id": "sY2r-oQBEoAIompjS0Xg",
        "_score": null,
        "_source": {
          "createdAt": "2022-11-25T09:45:00.000Z",
          "tags": [ "Response Needed" ]
        },
        "sort": [ 1, 1669369500000 ] // Tag exists, createdAt ascending order},
      {
        "_index": "idx_conditional_sort",
        "_id": "uY2r-oQBEoAIompjS0Xk",
        "_score": null,
        "_source": {
          "createdAt": "2022-11-26T09:45:00.000Z",
          "tags": []
        },
        "sort": [ 0, -1669455900000 ] // Tag does not exist, createdAt descending order (negative value ascending order)
      },
      {
        "_index": "idx_conditional_sort",
        "_id": "uo2r-oQBEoAIompjS0Xl",
        "_score": null,
        "_source": {
          "createdAt": "2022-11-23T09:45:00.000Z",
          "tags": []
        },
        "sort": [ 0, -1669196700000 ] // Tag does not exist, createdAt descending order (negative value ascending order)
      }
    ]
  }
}

As you can see from the hits array, documents are first grouped according to the presence of the tags field (the first element of the sort array is 1 for tags and 0 for no tags).

Documents with tags (the first element of the sort array is 1) are sorted first, and they are sorted internally in ascending order according to the timestamp of createdAt (1669283100000 before 1669369500000).
Untagged documents (the first element of the sort array is 0) are sorted at the back, and they are arranged internally in descending order according to the timestamp of createdAt (-1669455900000 corresponds to 2022-11-26, -1669196700000 corresponds to 2022-11-23. Due to negative values in ascending order, -1669455900000 is smaller than -1669196700000, so the documents on 2022-11-26 are ranked before the documents on 2022-11-23, achieving descending order).

Things to note and best practices

Performance considerations: Script sorting is generally more expensive than field sorting because it requires executing the script on each document. For large datasets, this can lead to performance degradation. If possible, avoid complex script sorting through index design or the use of runtime fields.
Script caching: Elasticsearch caches compiled scripts to reduce the cost of repeated execution. Therefore, the performance will be better when the same script is executed multiple times.
Field types: Make sure fields accessed in the script have the correct type. For example, doc['tags.keyword'] accesses the tags field of keyword type. For a date field, doc['createdAt'].value.toInstant().toEpochMilli() is a common way to get its millisecond timestamp.
Painless syntax: Painless is a safe and efficient scripting language dedicated to Elasticsearch. Familiarity with its syntax and API is key to writing effective scripts.
Alternatives: In some simple scenarios, you can consider using missing parameters or nested fields to handle missing values or sorting of array fields, but for the complex conditional logic in this example, script sorting is currently the most direct and powerful method.

Summarize

Through Painless script sorting, Elasticsearch provides extremely high flexibility to handle complex sorting requirements based on conditional judgments. Although script sorting will bring a certain performance overhead, it is an indispensable and powerful tool in scenarios where business logic cannot be implemented through standard field sorting. Understanding its working principles and best practices can help developers build search functions that meet diverse needs.

The above is the detailed content of Elasticsearch multi-field and conditional sorting: Painless script implementation guide. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undress AI Tool

Undress images for free

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undresser.AI Undress

AI-powered app for creating realistic nude photos

ArtGPT

AI image generator for creative art from text prompts.

Stock Market GPT

AI powered investment research for smarter decisions

Hot Article

The Notepad upgrade, cheaper YouTube TV, and Nova Launcher's new owner: News roundup

4 weeks ago By DDD

How to apply the facade pattern (Facade) in Golang Go language simplifies the API of complex systems

4 weeks ago By DDD

Solve the error of multidict build failure when installing Python package

1 months ago By DDD

Catch a Monster Best Mounts and Locations

4 weeks ago By DDD

How to configure Spark distributed computing environment in Java_Java big data processing

4 weeks ago By DDD

Popular tool

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Douyin level price list 1-75

20521

wifi shows no ip assigned

13634

Virtual mobile phone number to receive verification code

11967

Where is the login entrance for gmail email?

9008

How to turn off windows security center

8506

Related knowledge

How to configure Spark distributed computing environment in Java_Java big data processing Mar 09, 2026 pm 08:45 PM

Spark cannot run in local mode, ClassNotFoundException: org.apache.spark.sql.SparkSession. This is the most common first step of getting stuck: even the dependencies are not correct. Only spark-core_2.12 is written in Maven, but spark-sql_2.12 is not added. SparkSession crashes as soon as it is built. The Scala version must strictly match the official Spark compiled version - Spark3.4.x uses Scala2.12 by default. If you use spark-sqljar of 2.13, the class loader cannot directly find the main class. Practical advice: Go to mvnre

How to safely map user-entered weekday string to integer value and implement date offset operation in Java Mar 09, 2026 pm 09:43 PM

This article introduces a concise and maintainable way to map the weekday string (such as "Monday") to the corresponding serial number (1-7), and use the modulo operation to realize the forward and backward offset of any number of days (such as Monday plus 4 days to get Friday), avoiding lengthy if chains and hard-coded logic.

How to use Homebrew to install Java on Mac_A must-have Java tool chain for developers Mar 09, 2026 pm 09:48 PM

Homebrew installs the latest stable version of openjdk (such as JDK22) by default, not the LTS version; you need to explicitly execute brewinstallopenjdk@17 or brewinstallopenjdk@21 to install the LTS version, and manually configure PATH and JAVA_HOME to be correctly recognized by the system and IDE.

What is exception masking (Suppressed Exceptions) in Java_Multiple resource shutdown exception handling Mar 10, 2026 pm 06:57 PM

What is SuppressedException: It is not "swallowed", but actively archived by the JVM. SuppressedException is not an exception loss, but the JVM quietly attaches the secondary exception to the main exception under the premise that "only one exception must be thrown" for you to verify afterwards. It is automatically triggered by the JVM in only two scenarios: one is that the resource closure in try-with-resources fails, and the other is that you manually call addSuppressed() in finally. The key difference is: the former is fully automatic and safe; the latter requires you to keep it to yourself, and it can be written as shadowing if you are not careful. try-

How to correctly implement runtime file writing in Java applications (avoiding JAR internal write failures) Mar 09, 2026 pm 07:57 PM

After a Java application is packaged as a JAR, data cannot be written directly to the resources in the JAR package (such as test.txt) because the JAR is essentially a read-only ZIP archive; the correct approach is to write variable data to an external path (such as a user directory, a temporary directory, or a configuration-specified path).

What is the underlying principle of array expansion in Java_Java memory dynamic adjustment analysis Mar 09, 2026 pm 09:45 PM

ArrayList.add() triggers expansion because grow() is called when size is equal to elementData.length. The first add allocates 10 capacity, and subsequent expansion is 1.5 times and not less than the minimum requirement, relying on delayed initialization and System.arraycopy optimization.

Complete tutorial on reading data from file and initializing two-dimensional array in Java Mar 09, 2026 pm 09:18 PM

This article explains in detail how to load an integer sequence in an external text file into a Java two-dimensional array according to a specified row and column structure (such as 2500×100), avoiding manual assignment or index out-of-bounds, and ensuring accurate data order and robust and reusable code.

A concise method in Java to compare whether four byte values are equal and non-zero Mar 09, 2026 pm 09:40 PM

This article introduces several professional solutions for efficiently and safely comparing multiple byte type return values (such as getPlayer()) in Java to see if they are all equal and non-zero. We recommend two methods, StreamAPI and logical expansion, to avoid Boolean and byte mis-comparison errors.