-
Notifications
You must be signed in to change notification settings - Fork 123
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Signed-off-by: Alexei Karikov <[email protected]>
- Loading branch information
Showing
1 changed file
with
166 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,166 @@ | ||
# Bulk | ||
|
||
In this guide, you'll learn how to use the OpenSearch JavaScript Client API to perform bulk operations. You'll learn how to index, update, and delete multiple documents in a single request. | ||
|
||
## Setup | ||
First, create a client instance with the following code: | ||
|
||
```javascript | ||
const { Client } = require('@opensearch-project/opensearch'); | ||
|
||
const client = new Client({ | ||
node: 'http://localhost:9200', | ||
}); | ||
``` | ||
|
||
Next, create an index named `movies` and another named `books` with the default settings: | ||
|
||
```javascript | ||
const movies = 'movies'; | ||
const books = 'books'; | ||
|
||
client.indices.create({ index: movies }).then(() => { | ||
console.log(`Index ${movies} created`); | ||
}); | ||
|
||
client.indices.create({ index: books }).then(() => { | ||
console.log(`Index ${books} created`); | ||
}); | ||
``` | ||
|
||
|
||
## Bulk API | ||
|
||
The `bulk` API action allows you to perform document operations in a single request. The body of the request is an array of objects that contains the bulk operations and the target documents to index, create, update, or delete. | ||
|
||
### Indexing multiple documents | ||
The following code creates two documents in the `movies` index and one document in the `books` index: | ||
|
||
```javascript | ||
client.bulk({ | ||
body: [ | ||
{ index: { _index: movies, _id: 1 } }, | ||
{ title: 'Beauty and the Beast', year: 1991 }, | ||
{ index: { _index: movies, _id: 2 } }, | ||
{ title: 'Beauty and the Beast - Live Action', year: 2017 }, | ||
{ index: { _index: books, _id: 1 } }, | ||
{ title: 'The Lion King', year: 1994 } | ||
] | ||
}).then((response) => { | ||
console.log(response); | ||
}); | ||
``` | ||
As you can see, each bulk operation is comprised of two objects. The first object contains the operation type and the target document's `_index` and `_id`. The second object contains the document's data. As a result, the body of the request above contains six objects for three index actions. | ||
|
||
Alternatively, the `bulk` method can accept an array of hashes where each hash represents a single operation. The following code is equivalent to the previous example: | ||
|
||
```javascript | ||
client.bulk({ | ||
body: [ | ||
{ index: { _index: movies, _id: 1, data: { title: 'Beauty and the Beast', year: 1991 } } }, | ||
{ index: { _index: movies, _id: 2, data: { title: 'Beauty and the Beast - Live Action', year: 2017 } } }, | ||
{ index: { _index: books, _id: 1, data: { title: 'The Lion King', year: 1994 } } } | ||
] | ||
}).then((response) => { | ||
console.log(response); | ||
}); | ||
``` | ||
|
||
We will use this format for the rest of the examples in this guide. | ||
|
||
### Creating multiple documents | ||
|
||
Similarly, instead of calling the `create` method for each document, you can use the `bulk` API to create multiple documents in a single request. The following code creates three documents in the `movies` index and one in the `books` index: | ||
|
||
```javascript | ||
client.bulk({ | ||
index: movies, | ||
body: [ | ||
{ create: { data: { title: 'Beauty and the Beast 2', year: 2030 } } }, | ||
{ create: { data: { title: 'Beauty and the Beast 3', year: 2031 } } }, | ||
{ create: { data: { title: 'Beauty and the Beast 4', year: 2049 } } }, | ||
{ create: { _index: books, data: { title: 'The Lion King 2', year: 1998 } } } | ||
] | ||
}).then((response) => { | ||
console.log(response); | ||
}); | ||
``` | ||
Note that we specified only the `_index` for the last document in the request body. This is because the `bulk` method accepts an `index` parameter that specifies the default `_index` for all bulk operations in the request body. Moreover, we omit the `_id` for each document and let OpenSearch generate them for us in this example, just like we can with the `create` method. | ||
|
||
### Updating multiple documents | ||
```javascript | ||
client.bulk({ | ||
index: movies, | ||
body: [ | ||
{ update: { _id: 1, data: { doc: { year: 1992 } } } }, | ||
{ update: { _id: 2, data: { doc: { year: 2018 } } } } | ||
] | ||
}).then((response) => { | ||
console.log(response); | ||
}); | ||
``` | ||
Note that the updated data is specified in the `doc` field of the `data` object. | ||
|
||
|
||
### Deleting multiple documents | ||
```javascript | ||
client.bulk({ | ||
index: movies, | ||
body: [ | ||
{ delete: { _id: 1 } }, | ||
{ delete: { _id: 2 } } | ||
] | ||
}).then((response) => { | ||
console.log(response); | ||
}); | ||
``` | ||
|
||
### Mix and match operations | ||
You can mix and match the different operations in a single request. The following code creates two documents, updates one document, and deletes another document: | ||
|
||
```javascript | ||
client.bulk({ | ||
index: movies, | ||
body: [ | ||
{ create: { data: { title: 'Beauty and the Beast 5', year: 2050 } } }, | ||
{ create: { data: { title: 'Beauty and the Beast 6', year: 2051 } } }, | ||
{ update: { _id: 3, data: { doc: { year: 2052 } } } }, | ||
{ delete: { _id: 4 } } | ||
] | ||
}).then((response) => { | ||
console.log(response); | ||
}); | ||
``` | ||
|
||
### Handling errors | ||
The `bulk` API returns an array of responses for each operation in the request body. Each response contains a `status` field that indicates whether the operation was successful or not. If the operation was successful, the `status` field is set to a `2xx` code. Otherwise, the response contains an error message in the `error` field. | ||
|
||
The following code shows how to look for errors in the response: | ||
|
||
```javascript | ||
client.bulk({ | ||
index: movies, | ||
body: [ | ||
{ create: { _id: 1, data: { title: 'Beauty and the Beast', year: 1991 } } }, | ||
{ create: { _id: 2, data: { title: 'Beauty and the Beast 2', year: 2030 } } }, | ||
{ create: { _id: 1, data: { title: 'Beauty and the Beast 3', year: 2031 } } }, // document already exists error | ||
{ create: { _id: 2, data: { title: 'Beauty and the Beast 4', year: 2049 } } } // document already exists error | ||
] | ||
}).then((response) => { | ||
response.body.forEach((item) => { | ||
const createStatus = item.create && item.create.status; | ||
if (createStatus && !createStatus.toString().match(/2\d{2}/)) { | ||
console.log(item.create.error.reason); | ||
} | ||
}); | ||
}).catch((error) => { | ||
console.error(error); | ||
}); | ||
``` | ||
|
||
## Cleanup | ||
To clean up the resources created in this guide, delete the `movies` and `books` indices: | ||
|
||
```javascript | ||
client.indices.delete({ index: [movies, books] }); | ||
``` |