Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/molekilla/scalaregpubrobot
Scala Robot to download Sa companies from Registro Publico de Panama
https://github.com/molekilla/scalaregpubrobot
Last synced: 3 days ago
JSON representation
Scala Robot to download Sa companies from Registro Publico de Panama
- Host: GitHub
- URL: https://github.com/molekilla/scalaregpubrobot
- Owner: molekilla
- Created: 2012-03-08T16:13:21.000Z (over 12 years ago)
- Default Branch: master
- Last Pushed: 2012-03-27T22:35:42.000Z (over 12 years ago)
- Last Synced: 2024-04-10T02:56:47.005Z (7 months ago)
- Language: Scala
- Homepage:
- Size: 227 KB
- Stars: 4
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README
Awesome Lists containing this project
README
Scala RegPub Robot
======================================This is a web parser for Registro Publico de Panama to look for companies registered (useful for any newspaper).
At a glance
----------------------
*Using Akka 2.0 in Master/Worker classs
*Pimp My Library in JSoupExtensions
*Cake Pattern, which I'm not sure is the correct implementation, for the Storage trait
*jSoup for HTML parsing
*Have a trait for either ScalaJ or Dispatch. Using ScalaJ because Dispatch requires to set the correct charset
*MongoDB for storage
*Java wrapper for creting runnable jarTake note that this is not using SBT , which I tried but is failing to work properly.
You will need these libraries
*Akka 2.0
*Casbah commons, core and query
*Commons logging
*Dispatch core and http
*Http Client and Core (Dispatch dependencies)
*Joda Time (MongoDB dependency)
*jSoup
*MongoDB 2.7.3 and Driver
*ScalaJ collection and http
*SF4LJ (MongoDB dependency)