https://github.com/vanyasem/vk-scraper

Scrape VK media
https://github.com/vanyasem/vk-scraper

api downloader python scrape scraper vk vk-api vkontakte vkontakte-api

Last synced: 9 months ago
JSON representation

Scrape VK media

Host: GitHub
URL: https://github.com/vanyasem/vk-scraper
Owner: vanyasem
License: gpl-3.0
Created: 2017-06-24T16:44:59.000Z (about 9 years ago)
Default Branch: master
Last Pushed: 2023-09-26T17:14:52.000Z (almost 3 years ago)
Last Synced: 2025-02-07T18:04:43.742Z (over 1 year ago)
Topics: api, downloader, python, scrape, scraper, vk, vk-api, vkontakte, vkontakte-api
Language: Python
Homepage: https://pypi.org/project/VK-Scraper/
Size: 76.2 KB
Stars: 57
Watchers: 7
Forks: 17
Open Issues: 6
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

VK Scraper
==========
![Python 3.5, 3.6, 3.7](https://img.shields.io/pypi/pyversions/vk_scraper.svg)
[![PyPI](https://img.shields.io/pypi/v/vk-scraper.svg)](https://pypi.python.org/pypi/VK-Scraper)
[![AUR](https://repology.org/badge/version-for-repo/aur/vk-scraper.svg)](https://aur.archlinux.org/packages/vk-scraper)

vk-scraper is a command-line application written in Python that scrapes and downloads VK user's / community's data. Use responsibly.

To get a better understanding of how it works, head to [the docs](DOCS.md).

Features
--------
- Scrape photos
- Scrape videos (both uploaded and external)
- Scrape saved photos
- Scrape stories

Install
-------

#### Arch GNU/Linux
For the **stable** version ([vk-scraper](https://aur.archlinux.org/packages/vk-scraper)):

git clone https://aur.archlinux.org/vk-scraper.git vk-scraper

For the **git** version ([vk-scraper-git](https://aur.archlinux.org/packages/vk-scraper-git)):

git clone https://aur.archlinux.org/vk-scraper-git.git vk-scraper

Then build & install:

cd vk-scraper
makepkg -sic

Or [use an AUR helper](https://wiki.archlinux.org/title/AUR_helpers) of your choice.

#### Other distros
For the **stable** version:

$ pip3 install vk-scraper --upgrade --user

For the **git** version:

$ pip3 install git+https://github.com/vanyasem/VK-Scraper.git --upgrade --user

Usage
-----
To scrape media:
```bash
vk-scraper -u -p
```
*By default, downloaded media will be placed in `/`.*

To specify multiple users/communities, pass a comma separated list of users:
```bash
vk-scraper username1,community1,username2,username3,community2
```

You can also supply a file containing a list of users/communities:
```bash
vk-scraper -f scrape_list.txt
```

```
$ cat vk_users.txt
username1
community1
username2
username3
community2
...
```
*Usernames may be separated by newlines, commas, semicolons, or whitespace.*

Arguments
---------
```
--help -h Show help message and exit

--login-user -u Your VK username

--login-pass -p Your VK password

--filename -f Path to a file containing a list of users/communities to scrape

--destination -d Specify destination folder. By default, media will
be downloaded to /

--retain-username -n Creates a subdirectory for each scraped name when the flag is set

--media-types -t Specify media types to scrape. Enter as space separated values.
Valid values are image, saved, video, story, wall, or none
(defaults to image)

--latest Scrape only new media since the last scrape. Uses the last modified
time of the latest media item in the destination directory for comparasion

--quiet -q Be quiet while scraping

--maximum -m Maximum number of items to scrape

--offset -o Offset from which the scrape starts. 0 is from the oldest. (Defaults to 0)
```

Contribution
------------
1. Check open issues, or open a new one to start a discussion around
your idea or a bug you found
2. Fork the repository and make your changes
3. Send a pull request

Futurelog
---------
- Scrape by hashtag
- Scrape by location
- Save metadata to a file (likes, comments, etc)
- Sort photos by their albums

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/vanyasem/vk-scraper

Awesome Lists containing this project

README