How to Grab Multiple URL / links out of one email message ?

gcomm

Client
Регистрация
01.03.2011
Сообщения
332
Благодарностей
93
Баллы
28
What if there are 2 or more links within the one email message? How to grab all of the links ?


Lets say the regex is the same for all the mails in this scenario to keep it simple... or please show how to do different regex checks and pulls within the one email message. Tons of thanks to the responder.... ;-)
 

gemini

Client
Регистрация
10.03.2011
Сообщения
160
Благодарностей
31
Баллы
28
add second step with mail verification, but with different regex for links (same regexp to verification) didnt tested but should work.
 

gcomm

Client
Регистрация
01.03.2011
Сообщения
332
Благодарностей
93
Баллы
28
No that doesn't work - I am getting the results in a text file by doing a simple regex http:\/\/[\w\W]* which basically starts at the first link and dumps all the data out until the last link - therefore the entire message into a text file - not very practical. Then I attempt to parse the links out using - {-RegExp.RegExp-|-{-FieldData.FieldData-|-●3●0●7●5●3●4●-|-≡1≡8≡7≡0≡6≡8≡0≡8≡0≡9≡-}-|-http:\/\/.*)-|-0-}

Empty line is appended to the next step; So this is missing something as well.....
====================================================


2nd Method: have 4 seperate regex in the process mail step which returns an array - how to process this array ?

Result: <reg0>xxx.xxxx.com/x</reg2><reg1>xxx.shit.poo</reg2><reg3><reg1>xxx.shit.poo</reg3>

This seems to be the best method -
asssssssssssssssss.JPG
 

gcomm

Client
Регистрация
01.03.2011
Сообщения
332
Благодарностей
93
Баллы
28
bump: email seems to be a hot topic tonight
 

gcomm

Client
Регистрация
01.03.2011
Сообщения
332
Благодарностей
93
Баллы
28
last bump-
 

joma

Client
Регистрация
06.03.2011
Сообщения
107
Благодарностей
13
Баллы
0
u can create as many regex u want for 1 email module.
For that just keep hiting the + sign and add several expressions per line.
you will get the results automaticly, just need to select the appropriate one in the selection box, and use the respective result for your next steps.
 

gcomm

Client
Регистрация
01.03.2011
Сообщения
332
Благодарностей
93
Баллы
28
Using this regex in the Regex Builder delivers all urls :
(http|ftp|https):\/\/[\w\-_]+(\.[\w\-_]+)+([\w\-\.,@?^=%&amp;:/~\+#]*[\w\-\@?^=%&amp;/~\+#])?
also this
(http|ftp|https):\/\/[\w\-_]+(\.[\w\-_]+)+([\w\-\.,@?^=%&amp;:/~\+#]*[\w\-\@?^=%&amp;/~\+#]).*?
and a few other simple variations...

However the Email Processing area returns only 1 url (the first available url)

Adding using the + method as Joma stated only returns the first result that number of times.... therefore I receive the same link X number of times...

====================================

Alternatively appending the entire contents of the email (text / html) from one url to the last url http:\/\/[\w\W]* then parsing the text file itself returns ZERO data. WTF

Parsing DOM for urls is no problem - what is the confusion here ? Any insight ?

 

Кто просматривает тему: (Всего: 1, Пользователи: 0, Гости: 1)