$ sudo apt install httrack
Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
The following packages were automatically installed and are no longer required:
aspnetcore-runtime-6.0 aspnetcore-targeting-pack-6.0
dotnet-apphost-pack-6.0 dotnet-host dotnet-hostfxr-6.0
dotnet-runtime-6.0 dotnet-runtime-deps-6.0 dotnet-sdk-6.0
dotnet-targeting-pack-6.0 libopusfile0 libportmidi0
libsdl2-image-2.0-0 libsdl2-mixer-2.0-0 libsdl2-ttf-2.0-0
netstandard-targeting-pack-2.1 python3-altgraph python3-docopt
python3-donut python3-dropbox python3-humanize python3-jose
python3-jq python3-macholib python3-pydispatch python3-pygame
python3-pyinstaller python3-pyminifier python3-pymysql python3-pyvnc
python3-rsa python3-secretsocks python3-sqlalchemy-utc python3-stone
python3-websocket python3-websockify python3-xlrd python3-xlutils
python3-xlwt python3-zlib-wrapper starkiller
Use 'sudo apt autoremove' to remove them.
The following additional packages will be installed:
libhttrack2
Suggested packages:
webhttrack httrack-doc
The following NEW packages will be installed:
httrack libhttrack2
0 upgraded, 2 newly installed, 0 to remove and 1007 not upgraded.
Need to get 297 kB of archives.
After this operation, 814 kB of additional disk space will be used.
Do you want to continue? [Y/n] Y
Get:1 http://kali.cs.nycu.edu.tw/kali kali-rolling/main amd64 libhttrack2 amd64 3.49.4-1 [264 kB]
Get:2 http://kali.cs.nycu.edu.tw/kali kali-rolling/main amd64 httrack amd64 3.49.4-1 [32.9 kB]
Fetched 297 kB in 1s (272 kB/s)
Selecting previously unselected package libhttrack2:amd64.
(Reading database ... 412002 files and directories currently installed.)
Preparing to unpack .../libhttrack2_3.49.4-1_amd64.deb ...
Unpacking libhttrack2:amd64 (3.49.4-1) ...
Selecting previously unselected package httrack.
Preparing to unpack .../httrack_3.49.4-1_amd64.deb ...
Unpacking httrack (3.49.4-1) ...
Setting up libhttrack2:amd64 (3.49.4-1) ...
Setting up httrack (3.49.4-1) ...
Processing triggers for libc-bin (2.37-6) ...
Processing triggers for man-db (2.11.2-3) ...
Processing triggers for kali-menu (2023.4.3) ...
$ httrack
Welcome to HTTrack Website Copier (Offline Browser) 3.49-4+libhtsjava.so.2
Copyright (C) 1998-2017 Xavier Roche and other contributors
To see the option list, enter a blank line or try httrack --help
Enter project name :fake_yes24
Base path (return=/home/kali/websites/) :
Enter URLs (separated by commas or blank spaces) :https://www.yes24.com/main/default.aspx
Action:
(enter) 1 Mirror Web Site(s)
2 Mirror Web Site(s) with Wizard
3 Just Get Files Indicated
4 Mirror ALL links in URLs (Multiple Mirror)
5 Test Links In URLs (Bookmark Test)
0 Quit
: 2
Proxy (return=none) :
You can define wildcards, like: -*.gif +www.*.com/*.zip -*img_*.zip
Wildcards (return=none) :
You can define additional options, such as recurse level (-r<number>), separated by blank spaces
To see the option list, type help
Additional options (return=none) :
---> Wizard command line: httrack https://www.yes24.com/main/default.aspx -W -O "/home/kali/fake_aladin" -%v
Ready to launch the mirror? (Y/n) :Y
Mirror launched on Wed, 08 Nov 2023 03:18:10 by HTTrack Website Copier/3.49-4+libhtsjava.so.2 [XR&CO'2014]
mirroring https://www.yes24.com/main/default.aspx with the wizard help..
Done.
Thanks for using HTTrack!
*
# 형식) sudo mv [기존경로] [새 경로]
# 기존경로와 새 경로 사이에 반드시 띄어쓰기
# websites 폴더에 있는 fake_yes24를 /var/www/html/ 경로로 이동
sudo mv /home/kali/websites/fake_yes24 /var/www/html/
# 서버 구동
sudo service apache2 start
# 서버 상태 확인
$ sudo service apache2 status
[sudo] password for kali:
● apache2.service - The Apache HTTP Server
Loaded: loaded (/lib/systemd/system/apache2.service; enabled; pres>
Active: active (running) since Wed 2023-11-08 01:50:43 EST; 1h 59m>
Docs: https://httpd.apache.org/docs/2.4/
Process: 161761 ExecStart=/usr/sbin/apachectl start (code=exited, s>
Main PID: 161766 (apache2)
Tasks: 10 (limit: 2265)
Memory: 21.4M
CPU: 612ms
CGroup: /system.slice/apache2.service
├─161766 /usr/sbin/apache2 -k start
├─161769 /usr/sbin/apache2 -k start
├─161770 /usr/sbin/apache2 -k start
├─161771 /usr/sbin/apache2 -k start
├─161772 /usr/sbin/apache2 -k start
├─161773 /usr/sbin/apache2 -k start
├─162031 /usr/sbin/apache2 -k start
├─162523 /usr/sbin/apache2 -k start
├─162532 /usr/sbin/apache2 -k start
└─162533 /usr/sbin/apache2 -k start
Nov 08 01:50:42 kali systemd[1]: Starting apache2.service - The Apache >
Nov 08 01:50:43 kali apachectl[161765]: AH00558: apache2: Could not rel>
Nov 08 01:50:43 kali systemd[1]: Started apache2.service - The Apache H>
lines 1-24/24 (END)
주소창에 http://127.0.0.1/복제한_프로젝트명
를 입력해주면 카피한 사이트를 확인할 수 있다.
보안이 강화된 사이트의 경우 크롤링 후 페이지가 로딩되지 않을 수도 있다.