[SK shieldus Rookies 16기][Kali Linux] HTTrack 실습 - 사이트 복제하기

Jina·2023년 11월 8일
0

SK shieldus Rookies 16기

목록 보기
9/59
post-custom-banner

HTTrack 패키지 설치

$ sudo apt install httrack

Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
The following packages were automatically installed and are no longer required:
  aspnetcore-runtime-6.0 aspnetcore-targeting-pack-6.0
  dotnet-apphost-pack-6.0 dotnet-host dotnet-hostfxr-6.0
  dotnet-runtime-6.0 dotnet-runtime-deps-6.0 dotnet-sdk-6.0
  dotnet-targeting-pack-6.0 libopusfile0 libportmidi0
  libsdl2-image-2.0-0 libsdl2-mixer-2.0-0 libsdl2-ttf-2.0-0
  netstandard-targeting-pack-2.1 python3-altgraph python3-docopt
  python3-donut python3-dropbox python3-humanize python3-jose
  python3-jq python3-macholib python3-pydispatch python3-pygame
  python3-pyinstaller python3-pyminifier python3-pymysql python3-pyvnc
  python3-rsa python3-secretsocks python3-sqlalchemy-utc python3-stone
  python3-websocket python3-websockify python3-xlrd python3-xlutils
  python3-xlwt python3-zlib-wrapper starkiller
Use 'sudo apt autoremove' to remove them.
The following additional packages will be installed:
  libhttrack2
Suggested packages:
  webhttrack httrack-doc
The following NEW packages will be installed:
  httrack libhttrack2
0 upgraded, 2 newly installed, 0 to remove and 1007 not upgraded.
Need to get 297 kB of archives.
After this operation, 814 kB of additional disk space will be used.
Do you want to continue? [Y/n] Y
Get:1 http://kali.cs.nycu.edu.tw/kali kali-rolling/main amd64 libhttrack2 amd64 3.49.4-1 [264 kB]
Get:2 http://kali.cs.nycu.edu.tw/kali kali-rolling/main amd64 httrack amd64 3.49.4-1 [32.9 kB]
Fetched 297 kB in 1s (272 kB/s)    
Selecting previously unselected package libhttrack2:amd64.
(Reading database ... 412002 files and directories currently installed.)
Preparing to unpack .../libhttrack2_3.49.4-1_amd64.deb ...
Unpacking libhttrack2:amd64 (3.49.4-1) ...
Selecting previously unselected package httrack.
Preparing to unpack .../httrack_3.49.4-1_amd64.deb ...
Unpacking httrack (3.49.4-1) ...
Setting up libhttrack2:amd64 (3.49.4-1) ...
Setting up httrack (3.49.4-1) ...
Processing triggers for libc-bin (2.37-6) ...
Processing triggers for man-db (2.11.2-3) ...
Processing triggers for kali-menu (2023.4.3) ...

HTTrack으로 사이트 카피

$ httrack

Welcome to HTTrack Website Copier (Offline Browser) 3.49-4+libhtsjava.so.2
Copyright (C) 1998-2017 Xavier Roche and other contributors
To see the option list, enter a blank line or try httrack --help

Enter project name :fake_yes24

Base path (return=/home/kali/websites/) :

Enter URLs (separated by commas or blank spaces) :https://www.yes24.com/main/default.aspx

Action:
(enter) 1       Mirror Web Site(s)
        2       Mirror Web Site(s) with Wizard
        3       Just Get Files Indicated
        4       Mirror ALL links in URLs (Multiple Mirror)
        5       Test Links In URLs (Bookmark Test)
        0       Quit
: 2

Proxy (return=none) :

You can define wildcards, like: -*.gif +www.*.com/*.zip -*img_*.zip
Wildcards (return=none) :

You can define additional options, such as recurse level (-r<number>), separated by blank spaces
To see the option list, type help
Additional options (return=none) :

---> Wizard command line: httrack https://www.yes24.com/main/default.aspx -W -O "/home/kali/fake_aladin"  -%v  

Ready to launch the mirror? (Y/n) :Y

Mirror launched on Wed, 08 Nov 2023 03:18:10 by HTTrack Website Copier/3.49-4+libhtsjava.so.2 [XR&CO'2014]
mirroring https://www.yes24.com/main/default.aspx with the wizard help..
Done.
Thanks for using HTTrack!
* 

카피된 파일 DVWA 폴더로 옮기기

# 형식) sudo mv [기존경로] [새 경로] 
# 기존경로와 새 경로 사이에 반드시 띄어쓰기

# websites 폴더에 있는 fake_yes24를 /var/www/html/ 경로로 이동
sudo mv /home/kali/websites/fake_yes24 /var/www/html/

apache2 서버 구동

# 서버 구동
sudo service apache2 start

# 서버 상태 확인
$ sudo service apache2 status

[sudo] password for kali: 
● apache2.service - The Apache HTTP Server
     Loaded: loaded (/lib/systemd/system/apache2.service; enabled; pres>
     Active: active (running) since Wed 2023-11-08 01:50:43 EST; 1h 59m>
       Docs: https://httpd.apache.org/docs/2.4/
    Process: 161761 ExecStart=/usr/sbin/apachectl start (code=exited, s>
   Main PID: 161766 (apache2)
      Tasks: 10 (limit: 2265)
     Memory: 21.4M
        CPU: 612ms
     CGroup: /system.slice/apache2.service
             ├─161766 /usr/sbin/apache2 -k start
             ├─161769 /usr/sbin/apache2 -k start
             ├─161770 /usr/sbin/apache2 -k start
             ├─161771 /usr/sbin/apache2 -k start
             ├─161772 /usr/sbin/apache2 -k start
             ├─161773 /usr/sbin/apache2 -k start
             ├─162031 /usr/sbin/apache2 -k start
             ├─162523 /usr/sbin/apache2 -k start
             ├─162532 /usr/sbin/apache2 -k start
             └─162533 /usr/sbin/apache2 -k start

Nov 08 01:50:42 kali systemd[1]: Starting apache2.service - The Apache >
Nov 08 01:50:43 kali apachectl[161765]: AH00558: apache2: Could not rel>
Nov 08 01:50:43 kali systemd[1]: Started apache2.service - The Apache H>
lines 1-24/24 (END)

사이트 확인

주소창에 http://127.0.0.1/복제한_프로젝트명 를 입력해주면 카피한 사이트를 확인할 수 있다.

보안이 강화된 사이트의 경우 크롤링 후 페이지가 로딩되지 않을 수도 있다.

profile
공부 기록
post-custom-banner

0개의 댓글