Community Learn Tools Library Leisure

English

Home > Backend Development > PHP Tutorial > PHP combined with redis to achieve large file deduplication

PHP combined with redis to achieve large file deduplication

little bottle

Release： 2023-04-06 06:10:02

forward

2986 people have browsed it

The main content of this article is to use PHP multiple processes to cooperate with the ordered collection of redis to achieve large file deduplication. Interested friends can learn about it.

1. For a large file, for example, my file is

-rw-r--r-- 1 ubuntu ubuntu 9.1G Mar 1 17:53 2018-12-awk -uniq.txt

2. Use the split command to cut into 10 small files

split -b 1000m 2018-12-awk-uniq.txt -b Cut according to bytes, supported unit m and k

3. Use 10 php processes to read the file and insert it into the ordered set structure of redis. Repeated ones cannot be inserted. , so it can play the role of deduplication

<?php
$file=$argv[1];
//守护进程
umask(0); //把文件掩码清0
if (pcntl_fork() != 0){ //是父进程，父进程退出
        exit();
}    
posix_setsid();//设置新会话组长，脱离终端
if (pcntl_fork() != 0){ //是第一子进程，结束第一子进程  
        exit();
}    
$start=memory_get_usage();
$redis=new Redis();
$redis->connect(&#39;127.0.0.1&#39;, 6379);
$handle = fopen("./{$file}", &#39;rb&#39;);
while (feof($handle)===false) {
        $line=fgets($handle);
        $email=str_replace("\n","",$line);
        $redis->zAdd(&#39;emails&#39;, 1, $email);
}

Copy after login

4. View the acquired data in redis

zcard emails Get the number of elements

Get elements in a certain range, such as starting from 100000 and ending at 100100

zrange emails 100000 100100 WITHSCORES

If you want to learn PHP more efficiently, please pay attention to the

PHP video tutorial

on the PHP Chinese website.

The above is the detailed content of PHP combined with redis to achieve large file deduplication. For more information, please follow other related articles on the PHP Chinese website!

Related labels：

php redis

source：cnblogs.com

Previous article：Solution to the time error obtained by PHP's date() function in the xampp integrated environment Next article：[PHP] SMS interface (regular matching)

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Latest Articles by Author

What is mysql slow query

2019-05-31 18:00:19
Is the free version of mysql easy to use?

2019-05-31 17:53:44
How to enter mysql

2019-05-31 17:41:15
How to check mysql installation path

2019-05-31 17:32:51
How to use cmd to enter mysql

2019-05-31 17:24:18
What can mysql do?

2019-05-31 17:15:01
what does vue do

2019-05-31 16:58:16
How to use jquery's after method

2019-05-31 16:37:47
What does prop mean in jquery

2019-05-31 16:19:45
What does jq mean?

2019-05-31 16:04:54

Latest Issues

PHP arrays obtained from URL parameters do not behave as expected I have a URL parameter that contains the category ID and I want to treat it as an array li...

From 2024-04-06 22:09:02

0

1

1428

Where should I place CustomLog directive in apache I'm using php:7.2-apachedocker. I need to disable health check url login access log. Based...

From 2024-04-06 22:03:59

0

1

990

What is the format of the variables in the return value? I am a new learner of php. I found a piece of code: if($x<time()){return[false,'error']...

From 2024-04-06 21:55:20

0

1

778

Problems encountered when using opentbs to generate odt files: values of the same key are displayed in the same row instead of separate columns. I'm using a library called OpenTbs to create odt using PHP, I'm using it because columns a...

From 2024-04-06 20:18:18

0

1

483

Group MySQL results by ID for looping over I have a table with flight data in mysql. I'm writing a php code that will group and displ...

From 2024-04-06 17:27:56

0

1

406

Related Topics

More>

Popular Recommendations

Popular Tutorials

More>

Related Tutorials

Popular Recommendations

Latest courses

Redis command operation Chinese manual

223218
Math Academy's in-depth redis video tutorial

41675
Redis Chinese Development Manual

109
Yan Shiba redis video tutorial

143990

Latest Downloads

More>

Web Effects

Website Source Code

Website Materials

Front End Template