preg_match_all - afinar expresión regular

nelsoniv · #1 (**permalink**) 30/04/2012, 18:38

Hola estimados... luego de bastante tiempo acudo a uds luego de navegar y leer y seguir con el problema.

Estoy tratando de extraer los links de un feed con preg_match_all (no pude parsear con simplexml_load_file por el desorden) y resulta que lo que necesito es solamente extraer los links que comiencen con una url determinada.

Ejemplo:
http://juanito.perez.com/solucion/ayuda?1.php
http://juanito.perez.com/consultas/consultas?44.php
http://otraweb.com/problema/ayuda?23.php

extraer solo urls que comiencen por "http://juanito.perez.com/solucion/"... en este caso el output sería: http://juanito.perez.com/solucion/ayuda?1.php

Sé que necesito depurar el patron del script de abajo pero aún no doy con el indicado..

Código PHP:

  <?php

// retrieve link destinations 

function get_a_href($file){ 

$h1count = preg_match_all('/(href=")(.*?)(")/i',$file,$patterns); // depurar el patron

return $patterns[2]; 

}

$file = file_get_contents("http://juanito.perez.com/"); 

$x = (get_a_href($file)); 

print_r($x); 

?>

Alguna ayuda pleasee
Gracias!!

JorM · #2 (**permalink**) 26/12/2012, 14:02

Cita:

Ejemplo:
http://juanito.perez.com/solucion/ayuda?1.php
http://juanito.perez.com/consultas/consultas?44.php
http://otraweb.com/problema/ayuda?23.php

extraer solo urls que comiencen por "http://juanito.perez.com/solucion/"... en este caso el output sería: http://juanito.perez.com/solucion/ayuda?1.php

Si quieres que solo extraiga los Links que empiezan con "http://juanito.perez.com/solucion/" podría ser asi:

Código PHP:

Ver original<?php
$str = '
http://juanito.perez.com/solucion/ayuda?1.php
http://juanito.perez.com/consultas/consultas?44.php
http://otraweb.com/problema/ayuda?23.php
http://juanito.perez.com/solucion/ejemplo.php
http://juanito.perez.com/solucion/ejemplo2?var1=1&var2=2.php';
 
$expr = '#http:\/\/juanito\.perez\.com\/solucion\/.+\.php#'; //EXPRESION
$mCount = preg_match_all($expr,$str,$arrMatches);
foreach($arrMatches[0] as $values){
  echo $values.'<br/>';
}
?>

lo he probado y funciona.
Espero te sirva!!
Saludos!