What happens if the index key is repeated when using array_column? How to deal with duplicate index keys?

M66 2025-04-28

In PHP, array_column() is a very common function that allows extracting data from a specified column from a multidimensional array and returning it. This function is especially suitable for processing database query results or arrays similar to structured data. By specifying the column's key, array_column() can easily extract all data from an array from an array.

However, many developers may encounter a problem: What happens if the index keys of an array are repeated when using array_column() ? In this article, we will explore this problem and provide solutions.

Introduction to array_column()

The basic usage of the array_column() function is as follows:

 array_column(array $input, $column_key, $index_key = null): array

$input : The input two-dimensional array.
$column_key : The key name of the column to be extracted or the index of the column.
$index_key : optional. You can specify the index key that returns the array.

This function extracts data based on the given column key $column_key and returns a new array. If $index_key is specified, the returned array will use this key as index.

For example:

 $data = [
    ['id' => 1, 'name' => 'Alice', 'age' => 23],
    ['id' => 2, 'name' => 'Bob', 'age' => 24],
    ['id' => 3, 'name' => 'Charlie', 'age' => 25],
];

$names = array_column($data, 'name'); // ['Alice', 'Bob', 'Charlie']

The situation of duplication of index keys

When the $index_key parameter is specified, array_column() returns a new array using the $index_key value as the index. If the $index_key key in the input array is repeated, the subsequent value overwrites the previous value. This situation often confuses developers because some data may be lost.

Let’s take a look at an example:

 $data = [
    ['id' => 1, 'name' => 'Alice', 'age' => 23],
    ['id' => 2, 'name' => 'Bob', 'age' => 24],
    ['id' => 1, 'name' => 'Charlie', 'age' => 25],  // id repeat
];

$result = array_column($data, 'name', 'id');

print_r($result);

Output:

 Array
(
    [1] => Charlie
    [2] => Bob
)

In the above code, due to the duplication of the id key ( id value is 1), only "Charlie" is kept in the returned array, and the original "Alice" value is overwritten. This is how array_column() behaves when encountering duplicate indexes.

How to deal with duplicate index keys?

In some cases, we may want to keep all duplicate indexes instead of overwriting them. At this point, we can use some tricks to deal with this situation:

1. Reprocess the duplicate index after using array_column()

An easy way is to manually process the duplicate index after the array_column() function returns the result. For example, we can use array_merge() to merge values with the same index into an array.

 $data = [
    ['id' => 1, 'name' => 'Alice', 'age' => 23],
    ['id' => 2, 'name' => 'Bob', 'age' => 24],
    ['id' => 1, 'name' => 'Charlie', 'age' => 25],  // id repeat
];

$column = array_column($data, 'name', 'id');

// 处理repeat索引，确保所有repeat的值被保留
$column = array_map(function($v) {
    return is_array($v) ? $v : [$v];
}, $column);

print_r($column);

Output:

 Array
(
    [1] => Array
        (
            [0] => Alice
            [1] => Charlie
        )

    [2] => Array
        (
            [0] => Bob
        )
)

In this example, we use array_map() to convert each value into an array, which ensures that all values are preserved.

2. Use array_merge to merge duplicate values

If we want to merge all the duplicate values into an array, we can also do this by doing manipulation on the array. For example, the following code uses array_merge to merge duplicate indexes:

 $data = [
    ['id' => 1, 'name' => 'Alice', 'age' => 23],
    ['id' => 2, 'name' => 'Bob', 'age' => 24],
    ['id' => 1, 'name' => 'Charlie', 'age' => 25],  // id repeat
];

$result = [];
foreach ($data as $item) {
    $result[$item['id']][] = $item['name'];
}

print_r($result);

Output:

 Array
(
    [1] => Array
        (
            [0] => Alice
            [1] => Charlie
        )

    [2] => Array
        (
            [0] => Bob
        )
)

In this way, we can manually control how to handle duplicate indexes without losing any data.

Summarize

array_column() is a powerful tool that can help us extract data from a specified column from an array. However, when there are duplicate index keys in the array, the subsequent values overwrite the previous values, which can lead to data loss. To handle this situation, we can take different approaches, such as merging duplicate values or using an array function to further process the result. Depending on specific needs, choosing the right strategy to solve the problem of duplicate index keys can make the code more robust and reliable.